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PREFACE 


In the 1970s, there were few graph theorists among the 5000 members of the 
Japan Mathematics Society. Graph Theory was not even considered a suitable 
area in which to write a Ph.D. dissertation. 

The situation has changed over the last few years; through the efforts of a small 
but determined group of mathematicians, Graph Theory has flourished in Japan 
and Japanese graph theorists have received worldwide acceptance. The phenom- 
enal success of the First Japan Conference of Graph Theory and Applications 
held in Hakone, Japan, June 1-5, 1986 is an affirmation of this acceptance. 

The conference was attended by 200 participants representing 22 countries. 
This volume contains the Proceedings of that conference. 

The overwhelming response we have received from graph theorists all over the 
world encourages us to plan a Second Conference in Hakone, to precede the 
IMU Conference, scheduled to be held in Kyoto, Japan in 1990. 


Jin Akiyama 
Yoshimi Egawa 
Hikoe Enomoto 


for 


the Organizing Committee 


Acknowledgements 


Many people contributed to the success of the First Conference and the 
preparation of these Proceedings. We are grateful to the speakers, participants, 
and sponsors of the conference, as well as the referees for these Proceedings. In 
particular, we would like to thank: 


Paul Erdos and Ronald L. Graham 
for sowing the seeds of mathematical discovery among Japanese high school 
students 


D. Avis, V. Chvatal, M. Deza and P. Frankl 
whose annual visists to Japan inspire and encourage many young Japanese graph 
theorists 


Shigeru Katsumata, Mayor of Hakone and the People of Hakone 
for their hospitality and their colorful cultural presentations 


0012-365X/88/$3.50 © 1988, Elsevier Science Publishers B.V. (North-Holland) 


2 Preface 


Peter Hammer and the staff of Discrete Mathematics 
for their support during the preparation of these Proceedings 


Mari-Jo P. Ruiz 
for her continuing work in editing our Japanese English 


Lucia Hayakawa 
for initiating the conference participants into the ceremonies of the Japanese way 
of life 


Sundia Preparatory School 
for its generous support and the use of Hakone Seminar House 


Our stadents 
who worked tirelessly as our conference assistants. 


Discrete Mathematics 72 (1988) 3-13 3 
North-Holland 


HIGHLY IRREGULAR m-CHROMATIC GRAPHS 


Yousef ALAVI* 
Western Michigan University, Kalarnazoo, MI 49008, U.S.A. 


Fred BUCKLEY and Marc SHAMULA 
Baruch College (CUNY), New York, NY 10010, U.S.A. 


Sergio RUIZ** 


Universidad Catolica de Valparaiso, Valparaiso, Chile 


Received August 28, 1986 
Revised September 16, 1987 


A graph is highly irregular if it is connected and the neighbors of each vertex have distinct 
degrees. In this paper, we study existence and extremal problems for highly irregular graphs 
with a given maximum degree and focus our attention on highly irregular graphs that are 
m-chromatic for m 2 2. 


1. Introduction 


One of the most widely studied classes of graphs are regular graphs. They show 
up in many contexts. For example, r-regular graphs with connectivity and 
edge-connectivity equal to r play a key role in designing reliable communication 
networks [2]. In the study of line graphs, Ray-Chaudhuri [6] showed that L(G) is 
regular if and only if G is a regular graph or a semi-regular bipartite graph. 
Cages, which are minimum order r-regular graphs with girth g, have been widely 
studied—see the survey by Wong [7]. Godsil and McKay [4] examined graphs 
with the property that for each vertex u, the subgraph induced by the neighbors 
of v is regular and the subgraph induced by the nonneighbors of v is regular. 
They showed that such graphs are semi-regular (there are only two degrees) and 
the subgraph induced by the vertices of each degree are regular. 

In studying regular graphs, neighbors and neighborhoods often play a vital 
role. Indeed, a connected r-regular graph can be defined as a connected graph for 
which the neighbors of each vertex all have the same degree r. We study graphs 
at the opposite extreme from regular graphs. In [1] a connected graph is defined 
to be highly irregular if for each vertex v, all the neighbors of uv have distinct 
degrees. In Fig. 1, we display all the highly irregular graphs of order at most 8. Of 
the seven graphs in Fig. 1, all but one is bipartite. This very high proportion of 
bipartite graphs does not continue. For example, of the 16 highly irregular graphs 


* Research supported in part by a Western Michigan University Faculty Research Fellowship. 
** Research supported by the United Nations through the program PNUD-UNESCO, and the 
Universidad Catolica de Valparaiso (Chile). 
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° o——————-0 o_O o——<—1r0 


Fig. 1. The highly irregular graphs with at most eight vertices. 


of order 9 or 10. eight are bipartite. When we refer to a A-graph (or A-tree) we 
mean a graph with maximum degree A. We shall first examine existence and 
extremal problems for cycles in highly irregular graphs, focusing on bipartite 
graphs. 


2. Highly irregular trees 


In this section we determine the values of n for which there is a highly irregular 
tree of order n. Several preliminary observations from [1] will be useful. 


Remark 1. A vertex of degree A in a highly irregular A-graph is adjacent to 
exactly one vertex of each degree k, 1S k <A. 


It follows easily from Remark 1 that a highly irregular A-graph has order at 
least 2. A most useful property concerning the maximum degree A is the 
following. 


Remark 2. Let G be a highly irregular A-graph on at least four vertices. Then G 
contains an induced path P, whose internal vertices have degree A in G and 
endpoints have degree 1 in G. 


In what follows, we consider our (nontrivial) graphs to be doubly rooted at 
some pair of vertices u and uv of degree A. We note that 4-trees are especially 
important in chemical applications [5]. Let T be a highly irregular A-tree of 
minimum order. Then the maximal subtree 7, containing u but not v clearly has 
the same order as the maximal subtree 7, containing v but not u (see Fig. 2). The 
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u Vv 


T Ty Ty 


Fig. 2. Highly irregular 4-tree T with its subtrees 7, and T,. 


tree T has only two vertices of degree A. Otherwise, suppose T,, has a vertex of 
degree A, and let x be the nearest such vertex to u. Let y be the neighbor of x on 
the unique path from u to x. Then degy<A-—1 and the maximal subtree 
containing x and not y can be replaced by a tree of smaller order so that 
deg x < A—1. Then T is not minimun, a contradiction. 


Theorem A. The order of a highly irregular A-tree is at least 2°. 


Highly irregular graphs were shown in [1] to exist for all orders except n = 3, 5, 
and 7. We shall give the corresponding results for trees. First, we need two 
lemmas. Since each vertex of degree A has exactly one neighbor of each degree 
k, 1<k <A, maximum degree vertices occur in pairs. 


Lemma 1. In any nontrivial highly irregular A-graph, there is an even number of 
vertices of degree A. 


An easy observation is the following. 
Lemma 2. The highly irregular A-tree of order 2° is unique. 


Theorem 1. There exists a highly irregular tree for every order except n =3, 5, 6, 
7, 11, 12, and 13. 


Proof. The result for n <8 follows from Fig. 1. To obtain a highly irregular tree 
of order 9, subdivide one of the edges joining a vertex of degree 2 with a vertex 
of degree 1 in the last graph of Fig. 1. For the tree of order 10, subdivide the 
other such edge as well. 

Next we show that orders 11, 12, and 13 are not possible. By Theorem A, A is 
at most 3 for these values of n. In the tree of order 10, no additional subdivisions 
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are possible without losing the highly irregular property. Thus, when 11 <n <15, 
a highly irregular tree of order n must have at least three vertices of degree 3, and 
by Lemma 1 it must have at least four such vertices. Also, for T to be highly 
irregular, there must be a ‘buffer’ of two degree 2 vertices between a pair of 
nonadjacent degree 3 vertices. This forces at least the graph in Fig. 3. Thus there 
are no highly irregular trees of order 11, 12, or 13. Fig. 3 shows the tree of order 
14, and a tree of order 15 is obtained by subdividing edge xy. 

Let n = 16, let A= |log, n], and let x =n — 24. To construct a highly irregular 
tree T* of order n, begin with the unique highly irregular A-tree T of order 24 
doubly rooted at u and v. Note that the number x of additional vertices needed to 
produce 7* is less than 2*. Let u, and v, be the neighbors of u and v with degree 
k. Then let a and b be the vertices of degree 1 adjacent to uz and v2, respectively, 
let c be the vertex of degree 1 at greatest distance from u in the branch at u of T,, 
containing u;, and let e be the vertex of degree 1 adjacent to vu; (see Fig. 4, 
Step 1). In Fig. 4, 


n=49, soA=|log,49J=5 x =49-25=17=6(2) +5 


Let the number of additional vertices needed to produce T* be x =6y +1, 
where y, fe {0, 1, 2,...} and O<¢<S, If y>0, attach a string of y minimum 
order highly irregular 3-trees at vertex a so that vertex a has degree 2 (see Fig. 4, 
Step 2). If ¢=3, attach a path P, via a center vertex at d; if ¢=1 or 4, attach a 
pendant edge at b; and if ¢=2 or 5, attach pendant edges at both b and c (see 
Fig. 4, Step 3). The resulting tree is highly irregular and has n vertices. O 


In certain problems, for example in chemical applications of graph theory, one 
wants to restrict the value of A. If we limit A to 3, we can construct a highly 
irregular tree for n =8 and then for larger n by either subdividing at most two 
possible edges or adjoining an additional tree of order 6 containing two more 
degree 3 vertices (see Fig. 3). This process can be repeated and all highly 
irregular 3-trees can be obtained in this manner. Therefore, three consecutive 
orders for which highly irregular 3-trees exist are followed by three consecutive 
orders for which there is no highly irregular 3-tree, and visa versa. Thus, for 
infinitely many values of n, there is no highly irregular 3-tree of order n. When 
A =4, there are A-trees on n vertices for all n = 2°. In fact, such A-trees can be 
formed using exactly two vertices of degree A. 


Fig. 3. The highly irregular tree of order 14. 
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Fig. 4. Construction of a highly irregular tree of order 49. 


Theorem 2. There is a highly irregular 3-tree of order n if and only if n=2, 3, or 
4 (mod 6), where n=8. For A=4, a highly irregular A-tree of order n exists if 


and only ifn 2=2°. 


Note that the highly irregular 3-trees are all caterpillars. We denote the family 
of highly irregular A-graphs by @,. It is easily observed that @={K,}, 
G, ={K,}, and G = {P,}. We show in the next section that besides the 3-trees of 
Theorem 2, the family & of all highly irregular graphs with maximum degree 
3 contains only one other type of graph: highly irregular unicyclic graphs with 
girth 4x. 
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We now turn our attention to bipartite graphs that are not necessarily trees. 
Recall that the girth of a graph is the length of its smallest cycle. When A =3 or 
4, we would like to know the minimum order of a highly irregular bipartite graph 
with girth g = 2¢. A unicyclic graph is a connected graph containing exactly one 
cycle. The next result shows that a unicyclic highly irregular graph with maximum 
degree 3 is necessarily bipartite. 


Theorem 3. A unicyclic highly irregular graph with A=3 and girth g exists if and 
only if g =0 (mod 4). 


Proof. Suppose G is a unicyclic highly irregular graph with A = 3 and girth g, and 
let C be the cycle in G. Then each vertex u in C has degree 2 or 3, and one 
neighbor of v on C has degree 2 and the other has degree 3. Thus, the degree 3 
vertices on C come in consecutive pairs, as do the degree 2 vertices, and the pairs 
alternate as we proceed around the cycle. This implies that there is the same 
number x of degree 3 pairs on C as degree 2 pairs. Thus g = 4x, so g = 0 (mod 4). 

If g =0 (mod 4), we construct a unicyclic highly irregular graph with girth g and 
A=3 as follows: Let C =u,, v2,..., Ug, VU; and attach a pendant edge at each 
vertex Uy,,, and vy... O 


Since the neighbors of each vertex of degree 3 in Theorem 3 have degree 3 and 
degree 2 on C, its neighbor of degree 1 is not on C. The distribution of degree 3 
vertices on C then makes G uniquely determined. 


Corollary 1. For each positive integer g=0(mod 4), there is precisely one 
unicyclic highly irregular graph with A=3 and girth g, and it has order }(3g). 


Corollary 2. A highly irregular graph with A =3 that is not a tree is unicyclic. 


If G is a bipartite graph with k vertices in each part, then we call Ga k xk 
bipartite graph. 


Corollary 3. For all k =3, there is a highly irregular k x k bipartite graph with 
A=3. 


Proof. The unicyclic graphs with girth g =0 (mod 4) in Corollary 1 have order 
n = 3g/2. Since g =0 (mod 4), it follows that n =3g/2=0 (mod6). In Theorem 
2. we saw that there are highly irregular 3-trees with order n =2 or 4 (mod 6). In 
all cases, the resulting graph are k Xk bipartite. O 
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Corollary 4. Let G be a highly irregular k X k bipartite graph with A=3. If k =1 
or 2 (mod 3), then G is a unique tree. If k =0 (mod 3), then G is unique and 
unicyclic with a cycle of length 4k/3. 


The following remark is easy to prove. 


Remark 3. The minimum order of a highly irregular bipartite A-graph is 2 and 
the graph achieving the minimum is the unique highly irregular A x A bipartite 
A-graph. 

When A223, the graphs in Remark 3 all have girth 4. Thus, the minimum 
order of highly irregular bipartite A-graphs with girth 4 is 2A. 

We now restrict our attention to highly irregular bipartite 4-graphs. By Remark 
3, the minimum order of such a graph is 8. If, additionally, we require that G be a 
tree, Theorem A and Lemma 2 imply that the minimum order is 16. The graph of 
order 8 has girth 4 and the tree has infinite girth. Thus, we consider the 
intervening girths. Since G is bipartite, each cycle (and therefore the girth) of G 
is even. Let G be a bipartite graph with A =4 and girth g = 2¢, and let C be a 
cycle of length g in G. It is easy (but tedious) to show that if there is a vertex of 
degree 4 not on C, then the order of G is at least |4(3g)| + 6 when g > 6. 


Theorem 4. Let G be a highly irregular bipartite 4-graph with girth g<=. Then 
the minimum order of G is 


{° ifg=4, and 
get2lig]+6 ifg 26. 


Proof. Fig. 1 shows that the statement is true when g = 4, and the case for g =6 
can be easily checked. Thus, suppose g = 8. We first show that a minimum order 
graph is unicyclic. Suppose this is not the case, and let C, be a cycle in addition to 
a first cycle C, with g vertices. Then C, either shares some vertices with C, or is 
joined by some path to C, (Fig. 5). In either case, C, introduces at least 4g — 1 
additional vertices. Therefore, from C, and C, there are at least 4(3g) — 1 vertices 
and at most half of these have degree 2. Thus at least [4(3g — 2)] vertices on 
C, UC, have degree greater than 2. Since C, and C, meet in at most two vertices, 
at least [1(3g — 2)] — 2 vertices have neighbors not on C, UC. G has at least two 
vertices of degree 4. This produces at least six vertices not on C, or C2 (two of 
which have been counted already, thus an actual increase of 4). Thus the order of 
G is at least 


4(3g) — 1+ [4(3g —2)] -2+4. 


Since g = 8, [1(3g — 2)] =6, so the order of G is at least 4(3g) + 7 if there are two 
or more cycles. However, we can construct a unicyclic, highly irregular bipartite 
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Fig. 5. Some possibilities for C, and C). 


4-graph with girth g and order g + 2[4g| +6<4(3g) +7. Thus, the minimum 
order graphs are unicyclic. 

To construct a minimum graph when g2>6, begin with a cycle C: v,, 
UV2,...+, Ug, V;. Let the degrees in G be degu,=degu, =4. If g=0(mod 4), 
then deg v, =3 for k =2 or 3 (mod 4) and deg vu, =2 for k=0 or 1 (mod 4), 
1<k<g. If g=2 (mod 4), then deg v. = deg v,_, =3, deg uy, = 2 for k=0 or 3 
(mod 4), and deg v, =3 for k = 1 or 2 (mod 4), 3<k <g — 2 (see Fig. 6). Finally, 
in either case, attach a pendant edge at each vertex of degree 3 and attach a P, 
via one of its center vertices at v, and u,. The resulting highly irregular bipartite 
4-graph has girth g and order g + 2[4g/ +6. O 


We have characterized the graphs in the families @, G,, @, and %, and in 
Theorem 4 we determined the minimum order for bipartite graphs in %. It seems 
to be a difficult task to characterize graphs in G, for A>4. We can, however, 
determine the orders for which there is a graph in G,, A> 4. 


Fig. 6. 
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(a) (b) 
Fig. 7. 


Theorem 5. For any pair of integers n and A such that n= 2A = 8, there exists a 
graph of order n in G,. 


Proof. Suppose n = 24> 8. Let Hp denote the unique highly irregular bipartite 
A-graph with order 2A described in Remark 3. Let V = {v,, v2,...,U,} and 
U={u,, uz,..., Ug} be its parts, where deg u, = deg u, = k. Next, we describe a 
sequence (H,) of graphs in , such that the order of H, is 2A +¢, t=1,2,.... 
The graph H, is obtained from Hy by subdividing the edge v.u,_, with vertex v, 
(see Fig. 7a). The graph H, is formed from H,— vv, by adding vertex x 
together with the edge vx, (see Fig. 7b). The graph H; is formed from H, by 
adding vertex x, together with edge xx, (see Fig. 7c). 

The graph H, is obtained from H, by deleting u2 and adding the vertices wo, Zo 
and z,, together with the edges v,_,Wo, U,2, and 2Zpz,. In a similar manner, we 
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construct the graphs H;, H,, H>, Hg, Hy, which along with the graph H, are 
illustrated in Fig. 8. 

Since the set B = {4, 5, 6, 7, 8, 9} forms a complete residue system modulo 6, 
every integer n > 10 can be expressed in a unique way as n = 2A +j + 6k, for 
some j € B and k 1. Thus the graph H, is obtained by attaching a string of k 
minimum order highly irregular 3-trees at vertex z, in H;. Each such H, is highly 
irregular and has maximum degree A. UO 


It is easy to show that there is no bipartite graph in Y, of order 2A + 1, which 
gives the following. 


Corollary 5. For a given positive integer A>4, there exists a bipartite graph of 
order n in G, if and only ifn =2A orn22A +2. 


4. m-Chromatic graphs 


Corollary 5 has the following generalization kindly communicated to us by 
Chartrand. 


Theorem 6. Let G be a highly irregular m-chromatic A-graph, m23. Then 
A2=2m — 2. 


Proof. As noted earlier, all graphs in 4, 0<k <3, are bipartite, so A>4. If 
m=3, then A=4=2m — 2. 

Suppose that m>4. Let H be a critically m-chromatic subgraph of G. If 
H=K,,, then each vertex of H is adjacent to m — 1 vertices of H (having distinct 
degrees in G), so that H contains a vertex u with degg u = 2m — 2, which implies 
that A=2m — 2. 

Finally, suppose that H # K,,. Since m = 4, H is not an odd cycle (which is only 
3-chromatic). By a result of Brooks [3], H contains a vertex w of degree at least 
m. Since 6(H) =m — 1, w is adjacent to a vertex of degree (in G) at least 2m — 2 
so that A=2m-2. O 


Theorem 7. For given integers m (23) and A, with A=2m—2, there exists an 
integer f(m, A) such that whenever n = f(m, A) there is an m-chromatic graph G 
in G, having order n. 


Outline of proof. Begin with the graph Hy with vertex set 


V(Ab) = {@,,@, Ler ,4m—1,9,, b2,. . «5 Omi=1s C15 €25 = ere »>€m—1,€1, €2,- oe » m1} 


Highly irregular m-chromatic graphs 13 


and edge set 
E(Ho) = {bjby, ¢)€,, 5)¢;:f FA} {bjay, cjey jf = kK}. 


This graph is a highly irregular, m-chromatic graph of order 4m—4 with 
A=2m —2. For this fixed value of A, the graphs for values of n >4m-—4 are 
obtained using operations analogous to those used in the proof of Theorem 5. By 
using similar techniques with slightly more complicated constructions, we can 
obtain the graphs with A>2m—2. O 
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For every € >0 and every integer m >0, we construct explicitly graphs with O(m/e) vertices 
and maximum degree O(1/€7), such that after removing any (I — €) portion of their vertices or 
edges, the remaining graph still contains a path of length m. This settles a problem of 
Rosenberg, which was motivated by the study of fault torerant linear arrays. 


1. Introduction 


What is the minimum possible number of vertices and edges of a graph G, such 
that even after removing all but € portion of its vertices or edges, the remaining 
graph still contains a path of length m? This problem arises naturally in the study 
of fault tolerant linear arrays, (see [18]). The vertices of G represent processing 
elements and its edges correspond to communication links between these 
processors. If p, 0<p <1 is the failure rate of the processors, it is desirable that 
after deleting any p portion of the vertices of G, the remaining part still contains 
a (simple) path (= linear array) of length m. Similarly, if u, 0< <1 denotes the 
failure rate of the communication links, it is required that after deleting any pu 
portion of the edges of G the remaining part still contains a relatively long path. 
The objective is to construct such graphs G with a small number of vertices and 
edges, since these will give rise to efficient networks. Some variants of this 
problem are discussed in [14, 9, 19]. In this note we prove the following result. 


Theorem 1.1. For every € >0 and every integer m>1 there is a graph G, which 
can be explicitly constructed, with O(m/e) vertices and maximum degree O(1/€’), 
such that even after deleting all but €-portion of its vertices or all but €-portion of 
its edges, the remaining graph still contains a path of length m. 


This settles the problem raised by Rosenberg [18], and is also related to the 
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study of size Ramsey numbers (see [10, 4]). We note that the edge version of the 
above theorem for € = 3 was proved by Beck [4], and his proof can be modified 
to give the general case, without explicit construction. Our main contribution 
here is to obtain an explicit construction by combining the arguments of Beck [4] 
with some of the eigenvalues technique of [1-3] and the recent construction of 
expanders given by Lubotzky et al. [16]. 


2. The proof of the main result 


For a graph H =(V, E) and Sc V, let N,,(S) denote the set of all neighbors in 
H of vertices of S. Pésa [17] proved the following useful lemma which provides a 
relation between the expanding properties of a graph and the size of the longest 
path it contains. (See also [15] for a simpler proof.) 


Lemma 2.1. Let H =(V, E) be a nonempty graph. If |Ni(S)— S| =2|S|—1 for 
every vertex subset S < V of cardinality |S|<k, then H contains a path of length 
3k — 2. 


The following simple lemma is an old folklore result (see, e.g. [4]): 


Lemma 2.2. Any graph G on n vertices with averge degree d contains an induced 
subgraph H such that for every vertex-set S of H the number of edges incident to 
vertices in S is at least d |S|/2. 


Next we need a relation between the eigenvalues of a graph G and the density 
of its induced subgraphs. Let A=Ag be the n by n adjacency matrix of a 
d-regular graph G=(V,E) on V={1,2,...,n}. Clearly d is the largest 
eigenvalue of A and its eigenvector is the all 1 vector. Suppose that the absolute 
value of any other eigenvalue of A is at most A. For Sc V let e(S, S) denote the 
number of edges of G between vertices of S and vertices of § = V —S, and let 
e(S) denote the number of edges that join two vertices of S. 


Lemma 2,3. In the above notation, for every subset S c V of cardinality |S|= an 
|e(S) — 4da*n| < 4Aa(1—a@)- xn. (2.1) 
We note that the term 4da’n is roughly the expected number of edges in an 


induced subgraph of G of size a-n. Thus, for small A, every such induced 
subgraph has about the same number of edges. 


Proof. Define a vector f:V—R by f(i) = —1/|S| if ie S and f(é) = 1/(n — [S]) if 
i¢S. Since U7, f(i) =0, i.e. f if orthogonal to the eigenvector of the largest 
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eigenvalue of A, we conclude that |(Af, f)| <A f), where (, ) is the usual scalar 
vroduct. One can easily check that 


(Af, f) =2 ZO: -fG)= a> Po- > (F@ -fW)Y. 


eke 


For the specific f defined above 


n as 1 a. om 1 2 
LPO= Gras MM LUO-sMyr=es (+5 ace 


a 1 1 1 1 
(8, \(F+— 5) -a(F+— =) <a(S +), 
© Nis nisi) ~ Aisi ta is/| SNist * 15) 


which implies, by substituting |S| = an, that 

|e(S, 8) -— da(1 — a)n| SAa(1—-a@)-n. (2.2) 
Since G is a d-regular graph, 

2e(S) + e(S, §) =d |S| = dan, 


Thus 


e(S) =4dan — 4e(S, S). 
This and inequality (2.2) imply inequality (2.1). This completes the proof. O 


Proof of Theorem 1.1. Lubotzky et al. [16] showed that if p and q are primes 
congruent to 1 mod 4, with p a quadratic non-residue mod q, then there is an 
explicitly constructed d=p +1 regular graph G with n =q(q*-1)/2 vertices, 
such that the absolute value of each of its eigenvalues but the first is at most 
A=2Vd-—1. We next show that for properly chosen p and q, G satisfies the 
assertion of Theorem 1.1. We first consider the case of deleting vertices. Suppose 
we delete all but a set V of €-n vertices of G. By Lemma 2.3 the induced 
subgraph of G on V contains at least 4de7n — 4A €(1 — €)n edges, i.e. has average 
degree at least ed—A(1—€). By Lemma 2.2 this graph contains an induced 
subgraph H in which every vertex set of cardinality x hits at least 4(ed — A(1 — 
€))-x see Let S be an arbitrary vertex subset of H, of cardinality x = Bn < an, 
where a < e will be chosen later. We next show that for a properly chosen d: 


INu(S) — S| > 2 |S]. (2.3) 


Indeed, otherwise, if T= N,(S) —S, then |S U T|<3x and there are in H (and 
hence in G) at least 4(ed—A(1—€)B)-n edges joining vertices of SUT. 
However, by Lemma 2.3 


e(SU T) <4d9B’n + 4A - 3B(1 — 3B) -n 
and therefore the inequality 
ed — A(1— €) = 9Bd + 3A(1 — 3B) 
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must hold. Since A < 2V/d — 1 this implies 
—2Vd — 1(4— € — 9B) 


d< (2.4 
(98) 
Hence if we choose d such that 
4—€-9a\’ 
d>4- (——*) 
€-9a (2.5) 


then (2.4) is violated for all 8 <a@ and hence (2.3) holds. By Lemma 2.1 we 
conclude that H contains a path of length 3|an] — 2. Therefore if we choose e.g. 
a = ye and we choose the primes p and q in the construction of G such that 


2 2 
d=p+1>4(*) Ande Aes Gna) (2.6) 
€ 2 € 
we conclude that even if we delete all but € - n vertices of G, the remaining part 
still contains a path of length m. This completes the proof for the case of deleting 
vertices. The case of deleting edges is somewhat simpler. Indeed, if we delete all 
but an €-portion of the edges we are left with a graph of average degree e - d. 
This graph contains, by Lemma 2.2, an induced subgraph H in which any set of x 
vertices hits at least edix edges. Hence if S is any vertex subset of cardinality 


9a\? 
) one can check, as 
€—-—9a 


before, that |N;,(S) — S| >2|S|. Thus, by Lemma 2.1, H contains a path of length 
3|an| —2. It is easy to check that the previous choice of p, q given in (2.6) 
suffices to guarantee a path of length m in this case, as well. (In fact, here a 
slightly smaller d is enough.) By the standard results about the distribution of 
primes (see e.g. [8]), there is a choice for p and q for which (2.6), as well as the 
estimates n = O(m/e) and d = O(1/e’) hold. This completes the proof. O 


Bn <n of H, where a <ie, and if d satisfies d > 4( 


3. Related problems 


3.1. Size Ramsey numbers 


The size Ramsey number A,(G) of a graph G is the least number of edges in a 
graph H with the property that any two coloring of the edges of H contains a 
monochromatic copy of G. Size Ramsey numbers were first considered in [10], 
and several results on them can be found also in (6, 12, 13]. Beck’s result [4], 
mentioned in Section 1, resolves the problem raised by Erdés of estimating the 
size Ramsey numbers for paths. Beck’s construction, however, is probabilistic. 
Typically, explicit constructions are much more difficult to find than random ones 
for Ramsey type problems (see [13]). Our construction supplies an explicit 
example showing that the size Ramsey number for paths is linear. 
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3.2. Fault tolerant graphs for bounded degree trees 


A natural extension of Theorem 1.1 is obtained by replacing the requirement 
for paths of length m by a requirement for all trees of maximum degree k and size 
m. Beck [4] proved, without an explicit construction, that there exists a graph G 
with O(k -m- (logm)'*) edges, such that any set of half of its edges contains 
every tree of size m and maximum degree k. Very recently, Friedman and 
Pippenger [11] gave, for every € >0 and m, k =2, an explicit construction of a 
graph G with O(mk?/e) vertices and maximum degree O(k?/e€”) such that any set 
of an €-portion of its edges contains every tree of size m and maximum degree k. 
Their construction is based on an interesting generalization of Pésa’s theorem 
(Lemma 2.1) from paths to trees. Some other related results can be found in 
(5, 7]. 
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The main result of the paper is that if n is sufficiently large and 2<r=r(n) < log log logn 
then nm objects can be sorted in  r rounds by asking no more _ than 
2't nl (log n)?-™"/ (log log n)—!™" questions in each round. 


Suppose that we are given n elements in some total order unknown to us. We 
wish to identify this total order by asking questions in r rounds: we aks m, 
questions in the first round, then process the information obtained and in the 
second round we choose an appropriate set of m, questions, and so on, in the rth 
round was ask m, questions such that the answers to our questions tell us the 
order of the elements. A question is just a binary comparison: how are a and b 
related? The answer tells us that either a<b or b<a. Let us write SORT(n, r) 
for the minimal value of m for which we can always distribute our questions in 
such a way that m;<m for i=1,2,...,7r. Thus SORT(n, r) is the minimal 
number of parallel processors that can sort n elements in time r. 

In studying SORT(n, r) the nature of the difficulties depends on the relation- 
ship between n and r: whether r is large, say at least log n, or small, say less than 
log r, perhaps even bounded. Ajtai, et al. (2, 3] proved that SORT(n, |logn])< 
cn for an absolute constant c. Therefore, if r=logn then SORT(n, r)< 
2cn(log n)/r. Here we shall be concerned with the case in which r is small. This 
case has been studied by Haggkvist and Hell [12-14], Bollobés and Rosenfeld 
{10}, Bollobés and Thomason [11], Alon [4], Alon, Azar and Vishkin [5] and 
Pippenger [18]. (For a review of these results, see [9] and [6, Ch. XV]. Related 
results can be found in [1, 7 and 8].) In particular, Haggkvist and Hell [13] 
proved that SORT(n, r) = Q(n'*!”) for every fixed r. This was strengthened by 
Alon et al. [5] who showed that SORT(n, r) = Q(rn'*!”) for r=r(n) <logn and 
SORT(n, r) = Q(n'**""(log n)'”) for every fixed r. 

Concerning upper bounds, Bollobdés and Thomason [11] proved SORT(n, 2) < 
41n}3 log n if n is sufficiently large. Alon et al. [5] improved this to SORT(n, 2) = 
O(n? log n/(log log n)!). More importantly, Pippenger [18] proved that 
SORT(n, r) = 0(n'*"”"(log n)?~*”) for every fixed r. Even more, Pippenger used 
concrete sorting networks to sort the elements: he showed that the so-called 
Ramanujan graphs constructed recently by Lubotzky, Phillips and Sarnak [16-18] 
can be used as efficient sorting networks. 
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In this paper we shall give an upper bound for SORT(n, r) in the case when r is 
constant or rather small compared to n. Similarly to the proof of most of the 
upper bounds, we shall use random graphs. (For an introduction to random 
graphs, see [6].) A weaker form of the main result of this paper was announced at 
the Conference on Combinatorics, Graph theory and Computing in Boca Raton 
in 1982. (The result was slightly misquoted in [9].) 

Let V be the set of n elements we wish to sort. The questions asked in the first 
round form a graph G with vertex set V. The answers we get give us an acyclic 
orientation G of G and they tell us that the total order we are searching for is an 
extension of the partial order on V determined by G. Our aim is to show that 
there is a graph G whose size e(G) is rather small, which is such that in every 
acyclic orientation G of G the vertex set can be covered by sets V,, Vo,..., V, of 
size at most ¢, where both s and ¢ are small, such that any two elements 
incomparable in the partial order determined by G belong to some V;,. Hence if in 
the remaining r — 1 rounds we sort each V, then V itself will be sorted. Therefore 
SORT(n, r) < max{e(G), s SORT(t, r—1)}. We shall show that an appropriate 
random graph will do for G. First we introduce some graphs that will turn out to 
be suitable for sorting. 

Let ag=1<a,<---<a,<a and 1<d,<d,_,<---<d,. Let G be a graph 
with vertex set V = V(G). We call G an (a; a, d); a2, d2;...34a,, d,)-graph if 

(i) for all V;c V, [V,| 2a, there exists a set W, c U,, |[W,| =|V\V;| — a such that 

if X;, c V; and [X;| <a; then |I'(X;) N V;| = d; |X; 

(ii) for all A, c V, |A,|2a,, we have |V\(I'(A,) UA,)| Sa. 

Note that both (i) and (ii) express the fact that G is ‘spreading’. Condition (i) 
says that a given large subset V; of vertices, all not-too-large subsets in a large 
subset of V\V,; have many neighbours in the second. Condition (ii) is even 
simpler: the complement of G does not contain a K(a, a,), a complete bipartite 
graph with a vertices in one class and a, in the second. 

Let us see what happens if we use an (a;4,,d,;...;34;, d,)-graph to sort n 
elements. 


Lemma 1. Let G be an (a;a,,d,;...;a,,d,)-graph, let G be an acyclic 
orientation of G and set 


u=| >, log(a;/a;_1)/log d,| +1, 
é=1 


where ag=1. Then in G all but at most h+2ua vertices dominate at least h 
vertices. 


Proof. Take a complete order on V = V(G) consistent with G. Let T be the set 
of the top n —(h + (2u — 1)a) vertices, A, the next 2a vertices, A, the next 2a 
vertices, etc., up to A,_,, and then let B be the bottom A + a vertices of V. Thus 
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TUA,UA,U---UA,_,UB is a partition of V. For simplicity, we also write 
Ag = T and A, = B. 

Let u, be the minimal integer satisfying d{!>a,, let u, be the minimal integer 
satisfying d{'d3? >a», etc., up to u,. Then Yj_ju;<u-—1. For 1<j<u-1 let 
i(j) be the integer defined by 


i()-1 i(j 
5 wie > wu 
i=1 


i=1 


Thus in the sequence d,,d,,..., 4, d,d,,...,d,,d;,d;,..., with d, 
repeated u; times, the jth term is d;,). 
Define sets A;c Aj, |A|2=|Ajl—@, j=u—1,u—2,...,0, as follows. Set 


A,-1 = Au-1- Having defined A,_1, Ay-2,..., Aj, 721, set i=i(j) and V,=A,. 
Let W, c V\V,, |Wil=|IV\V,| — a =a, be the set whose existence is guaranteed by 
condition (i), and set A,_; = W;N U;. 

By construction, |Ao| =|T| —a =n — 2ua —h, so to prove the lemma it suffices 
to show that every vertex in Ay dominates at least h vertices in the partial order 
given by G. Let x € Ao and let X; be the set of vertices in A; dominated by x. 
Then, by the definition of the sets A, we have 


|X| =d,, |X| =d), eaey Xu, 2dp >a, caey Xu ,+ul 
Sdid’>a,,..., |X,.|2did%,...,d">a,. 


Finally, by property (ii), X,-; dominates all but at most a vertices in B = Au; in 
fact, all but at most a@ vertices in B are joined to at least d vertices in X,,_,. Hence 
x itself dominates all but at most a vertices in B so it dominates at least 
|B|—a=h vertices. O 


Before applying Lemma 1 to prove the inequality mentioned in the introduc- 
tion, connecting sorting in r rounds to sorting in r—1 rounds, let us introduce 
some more terminology. Given an acyclic orientation Gofa graph G with vertex 
set V, for x € V let 0,(x) be the minimal order of x in an extension of G to a 
complete order and let o*(x) be the maximal order of x. Note that x dominates 
precisely 0,(x) — 1 elements in G and is dominated by n — o*(x) elements, where 
n=|V|. In particular, o,(x) =1 iff x is a minimal element of the partial order 
determined by G and o*(x) =n iff x is a maximal element. Lemma 1 claims that 
{x €V:0,(x) <h}|<h + 2ua and, by symmetry, |{x «EV :0*(x) >h}|<n-hAt+ 
2ua. 


Lemma 2. Suppose there is an (a; a,,d,,...,4a,, d,)-graph G of order n. Set 


“= [> log(a,/a,_,)/log a,| +1 
i=1 
and let v =2 ua, where ag=1. Then 


SORT(n, r) < max{e(G), n/2u SORT([4v], r— 1}. 
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Proof. Let us use G to sort in the first round; say G is the acyclically oriented 
graph given by the answers to our questions. Set j = |n/2u], ho=0, hj =n, and 
for 1<i<j—1 define h,=(2i+1)v. Note that h;_,+vu>n. Furthermore, for 
1 <i <j define 


V, = {x € V:0,(x) <A, and o*(x) >h,_1}. 
We claim that 
|Vi| = 4u (1) 


for all i, 1<i<j. To see (1), define 


B, = {xe V:0,(x) <h;} 
and ; 
T, = {x € V:0*(x) >h,_4}. 


Then, by Lemma 1, 
|B|<h,+u and |7,|\<a—h,_, +0. 


Since B; U T; = V and V;= B,NT;, for 2<i <j — 1 we have |V,| <2u +h; -—h,_, = 
4u. Also, V, = B, so |V,| =|Bil| <A, + v = 4u and V, = 7; so [Vl =|7F| <1 —hy_, + 
v <4v, completing the proof of (1). 

We claim also that if x and y are incomparable vertices in V then there is an 
index i, 1<i<j, such that x and y both belong to V,. Indeed, as x and y are 
incomparable, there is an extension of G to a complete order in which x is the 
predecessor of y; say, x is the Ath element and y is the (h + 1)st. If we 
interchange x and y then we get a complete order extending G in which x is the 
(h+1)st element and y is the Ath. In particular, 0,(x)<h<o*(x) and 
0,(y)<h<o*(y). Let 1<i<j be such that h;_.}<h<h;. Then o,(x)<h,, 
o*(x) >hj_,, 0,(¥) <h; and o*(y) >h;_; so x, y € V;. 

Having used G to sort our elements in the first round, to complete the sorting 
in r— 1 rounds, it suffices to sort each set V; in ry —1 rounds. That can be done 
with 


j 
> SORT(|V;|, r— 1) <j SORT(4v, r-1) 
i=1 


questions a round, implying the assertion of our lemma. O 


Our next aim (Theorem 4) is to show that in a suitable space of random graphs, 
we can find an appropriate (a; a,, d,;@2, d2)-graph. This result can be proved in 
many ways; a part of the assertion will be deduced from the following lemma of 
Pippenger [18], which is essentially from [3]. For the sake of completeness we 
include its short and simple proof. 
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Lemma 3. Let G be a graph whose complement does not contain a K(b +1, b + 
1), and let Vs <c V = V(G), |Vol =55. Then there is a set BV, |B| <b, such that 
if X <V\(VoU B) and |X| <b then 


[F(X) N Vol = 2X1. 


Proof. Let B= {Bc V\Vo:|B| <b, |'(B)N Vol <2|B| — 1}. If B= then B=6 
will do. Otherwise let B be a maximal element of B. Since no edge joint B to 
Vo\I'(B) and Vo\|I'(B)| < 56 — (4b — 1) =b +1, we have |B| <b. Also, if Xc 
V\(VoU B) and 1<|X|<5b then X UB €Bso 


|F(BUX)NV| =2|BUX|=2 [Bl] +2 |X|, 
implying 
IT(XXYN Vy =2|X|4+1. 0 


Theorem 4. There is an no such that the following assertion holds. Let 
n2=no, 2<r=r(n)<logloglogn, a=2'"', w= (logn)/loglogn, m= 
n'—""(log n)™"~\(log logn)""", a =(3/a)m=3.2-'m, ay=2-"m, dy = 
(log n)i, ay=a/5 and d,=2. Then there is an (a; a,, dy; a, d»)-graph of order n 
and size at most a(log n)n?/m. 


Proof. Set p =2an"’ (log n)?~*"(log log n)~°~'”"/n = 2a(log n)/m and consider 
the space G(n, p) of random graphs. To prove the theorem, it suffices to show 
that if n is sufficiently large, with the threshold not depending on r=r(n), then 
the probability that G, is an (a; a;, d;)-graph is at least 2 for i=1, 2. Indeed, as 
the probability that the size of G, is at most a(log n)n?/m = pn’/2 tends to 3, this 
implies that if n is large enough then the probability that G, will do the for the 
graph in our theorem is at least 4, say. In fact, we shall show that as n>», 
almost every G, is an (a; a,, d,; a2, d>)-graph. 

Let us see first that a.e. G, is an (a; a2, d,)-graph. By Lemma 3 is suffices to 


show that almost no G, contains a K(b+1,5+1), where b= la2|. Let 
X = S(G,) be the number of K(b + 1, b +1) graphs in G,. Then 


£O0= 3, baa Joes (ZY em 


1)/r 


Since Sa < (log log n)”~” if n is large enough, (en/a2)? <n if n is large enough. 


Also, paz = log n, so 


E(X)< now < non 


if n is sufficiently large. 
Let us turn to the proof of the fact that a.e. G, is an (a;a,, d,)-graph. Set 
k = flogn| and write Py for the probability that a fixed vertex x is joined to at 
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most k vertices in a fixed set of @ = |a] vertices. Since k < }ap, 
k 
Py< (7 or —p)* < 2(“2) eno TKR 3(6e) 087 1e~Stogn < no 


if n is sufficiently large. 

Let 2, denote the event that G, contains two disjoint sets of @ vertices each, 
say A, and A), such that every vertex in A, is joined to at most k vertices in Ap. 
Clearly, 


nean(")("5")P 


if n is large enough. 

We shall show that, on the other hand, the probability of the event 92, that 
there are sets X,, X,< V(G,) such that |X,|<S = |a,], |X2|<d,|X.| and every 
vertex in X, is joined to at least k vertices in X, is also rather small. Since the 
subgraph spanned by X,U X, has at least k |X,| edges, we have the following 
rather crude estimate; 


reas (Nee 


t 


IN 
rer— 
ia) 
a|$ 
SS 
N 
= 
i] 
ww 
——e 
A 
= 
1 
nN 


(2) 


where t = |sd,] = |s(log n)3]. Hence 


en\*" /ed2a, p\* leu 
(a) <2(2) (=e?) 
(2) <> Fi 
Note that 
k/(edja, p) 22 gun 
(log n)? 
and 
3d\a, log n< uk |a,] 
10 
so 
P(2,)< Zee >? (3) 


if n is large enough. 

By inequalities (2) and (3) we have P(Q,U Q,) <2n~*. Therefore the proof of 
the theorem will be complete if we show that if G, is such that neither 2, nor Q, 
holds (i.e. G, ¢ 92,U 2.) then G, is an (a;a,, d,)-graph. Suppose then that 
G, € 2, U 9, and V, c V has at least a vertices. Let W, = {x € V\V:x is joined to 
at least a vertices in V,}. Then |W,|2=|V\V,|—a@ since, 92, holds. Suppose 
X,cW, and |X,|<a,. Let X,=I(X,)NV,. Then, since 92, holds, [X,| > 
d,|X,|. O 


We are ready to prove the main result of the paper. 


Sorting in rounds 27 


Theorem 5. There is an n, such that if n =n, and 2<r=r(n) <loglog log n then 


SORT(n, r) <f(n, 7), 
where 
f(n, r) on 27*1n1*""(log n)*- "(log logn)~°~" 


Proof. Let n=1no, a, M, r, a, d,, a2, d, be as in Theorem 4 and let G be the 
(a; a, d,; a, d>)-graph of order n and size at most f(n,r) whose existence is 
guaranteed by Theorem 4. Set 


u = flog a,/log d, + log(a2/a,)/log d2| +1, 
as in Lemma 2. Then a,/a, < 2"~? so 
WS cee 4 1S oe beg tH <2 jcoee 
This implies that 
uv = [4(log n/log log n)a| = [3.2~’*'n'~""(log n)*” (log log n)~"”"| 
can be shown in is Lemma 2, so 
SORT(n, r) < max f(n, r), 5 SORT(4v, r— nt. 
Consequently, 
SORT(n, r) <f(n, r) 
if 
5 SORT(4u, r—1)<f(n, 7). 
Since SORT(4u, a) = (4 < 8v? and 
4nu <6ni(log n)(log log n)~! <f(n, 2) = 8n(log n) (log log n)~}, 


this implies that if n =no then SORT(n, 2) <f(n, 2). 

We shall prove by induction on r that if 2<r<log log logn and n'” > ng then 
SORT(n, r) <f(n, r). This implies the theorem since it gives that if n > 79°, say, 
then SORT(n, r)<f(n, r) for all r, 2<r<logloglogn. As the case r=2 has 
been proved, let us assume that 3<r7<loglogn and the assertion has been 
proved for smaller values of n. Note that v >n°~'” so v““-) > ng. Hence, by 
the induction hypothesis, SORT(4u, r — 1) <f(4v, r— 1). Therefore 


SORT(4u, r— 1) v2’ (4v)'*""" Dog 4v)?-2"-D(log log 4p) 0-2-0) 


<Vt yn gVe-Dy “—D(Jog nj 4—-Dilog log ny Co e-y) 
< 2 'n'*!"(log n)- "(log log nye =f(n, r), 


proving the induction step. O 
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In conclusion, let us note that the networks we use are essentially independent 
of the answers we get, i.e. they hardly depend on the order of the elements. To 
be precise, the sorting algorithm implied by the proof of Theorem 5 makes use of 
r graphs, say G,, G2,...,G,, depending only on n and r, such that the first 
round questions are asked according to G,, then V is covered by some sets 
V,, Vo,..., V, of order |G,| and in the next round each V; is sorted by G2. Then 
each V;, is covered by sets Vj, Vi2,..., Vie of order |G3| and each V;,; is sorted by 
G;, etc. In the rth round we compare all pairs of elements belonging to some set 
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Let G be a group generated by X. A Cayley graph over G is defined as a graph G(X) whose 
vertex set is G and whose edge set consists of all unordered pairs [a, b] with a, b eG and 
a'be XUX™', where X~' denotes the set {x~'| x € X}. When X is a minimal generating set 
or each element of X is of even order, it can be shown that G(X) is Hamiltonian iff it is 
edge-Hamiltonian. Hence every Cayley graph of order a power of 2 is edge-Hamiltonian. 


1. Introduction 


Throughout this paper, we shall only consider finite undirected simple graphs. 
For each such graph I, we shall denote the vertex set of IT by V(I-) and the edge 
set by E(I). The letter G will always denote a finite group generated by a set X 
where the identity « of G is not in X. A Cayley graph over G is defined as a graph 
G(X) whose vertex set is G and whose edge set consists of all unordered pairs 
[a, b] with a,b€G and a"'be XUX™! (X7'= {x7 |x © X}). In other words, 
two vertices a, b of V(G) are adjacent iff ax = b or ax~'=b for some x in X. We 
shall call the corresponding edge [a, b] an x-edge and a, b the ends of [a, 5]. It is 
easy to see that each element a of G gives rise to an automorphism 9, of G(X) 
where @,:x— ax. Further, the mapping a— ©, is an embedding of G into the 
automorphism group of G(X). Because of this, it is clear that the graph G(X) is 
vertex-transitive in the sense that for any two vertices a, b of G(X), there exists 
an automorphism (namely, @, where c = ba~') of G(X) which maps a to b. In 
fact, the converse is almost true, for there are only four known examples of 
connected vertex-transitive graphs which are not Cayley graphs. 

Among the many conjectures which intngue combinatorists nowadays, the 
following conjecture of Lovasz is undoubtedly one that attracts much attention. 


Conjecture 1. [Lovasz] Every connected vertex-transitive graph has a Hamilton 
path. 


As Cayley graphs form a special class of vertex-transitive graphs, the above 
conjecture naturally leads to the following more specific conjecture: 


Conjecture 2. Every Cayley graph has a Hamilton cycle. 
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Though much effort has been made in order to verify the conjecture, yet up to 
date, it is known that the conjecture is only true for Cayley graphs over some 
very special classes of groups. For instance, it is fairly easy to show that Cayley 
graphs over Abelian groups are always Hamiltonian. However, for Cayley graphs 
over non-Abelian groups, the conjecture is far from being solved. Chen and 
Quimpo prove in [2] that Cayley graphs over groups of order pq, where p, q are 
primes, are Hamiltonian and in {3] that Cayley graphs over Hamiltonian groups 
(ie. non Abelian groups in which every subgroup is normal) are Hamiltonian. 
The best known result so far is perhaps the one proved by Keating and Witte in 
[4], which states that if G is a group whose commutator subgroup is a cyclic group 
of prime power order, then any Cayley graph over G is Hamiltonian. Witte [5] 
also showed recently that every Cayley graph of prime power order is Hamil- 
tonian. Furthermore, for most of these known results, it can be shown that the 
corresponding Hamiltonian Cayley graphs are in fact edge-Hamiltonian, in the 
sense that every edge lies on a Hamilton cycle. This in turn leads to another 
interesting conjecture: 


Conjecture 3. Every Hamiltonian Cayley graph is edge-Hamiltonian. 


The main objective of this paper is to give a partial solution to this conjecture. 
Among others, we shall show that when each element of X is of even order, 
G(X) is Hamiltonian iff it is edge-Hamiltonian. Hence Cayley graph over a group 
of order a power of 2 is always edge-Hamiltonian. 


2. Basic lemmas 
The following lemmas will be useful in the sequel. 


Lemma 1. A Cayley graph G(X) is edge-Hamiltonian iff for each x in X, G(X) 
contains a Hamilton cycle containing an x-edge. 


Proof. Apparently if G(X) is Hamiltonian, then the given condition holds. 
Conversely, assume that for each x in X, G(X) contains a Hamilton cycle with an 
x-edge. To prove that G(X) is edge-Hamiltonian, let e = {a, b] be any edge of 
G(X) with a~'b =x eX UX7'. Without loss of generality, we may assume that 
x eX. By the given condition, G has a Hamilton cycle C containing an edge 
e’=[c, d] with c~'d =x. Now consider the mapping ©,:y— gy where g =ac™’. 
Then ©, is an automorphism of G(X) and so ©,(C) is also a Hamilton cycle of 
G(X). However, @,(c) = gc =a@ and @,(d) = gd = gcc™'d = gcx =ac~'ca~'b = b. 
Hence the edge {a, b] lies on the Hamilton cycle ©,(C) which proves that G(X) 
is edge-Hamiltonian, as required. O 
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Lemma 2. Let C be a Hamilton cycle of a Cayley graph G(X). Let Y= 
{a~'b | {a, b] € E(C)}. Then Y generates G. 


Proof. Let the Hamilton cycle C be (uo, v;,..., U,), where n is the order of G 
and vp=v,, =1, the identity of G. Let uv; 'u;,,;=x; where i=0,1,...,n—-1. 
Then the set Y’ = {x;|i=0,1,...,—1} is contained by Y. Moreover, as each 
uv; is the product of x9, x,,...,X;-1, we see that Y’ generates G and so Y also 
generates G, as required. O 


Let I be a cubic graph. By a perfect matching partition (PMP) of , we mean 
an unordered triple P=[P,, P;, P;] where the P’s are edge-disjoint perfect 
matchings of I’. Apparently, they form a partition of E(I) into perfect 
matchings. Moreover, by an even cycle partition (ECP) of [, we mean a set 
C={C,..., Cy} of vertex-disjoint cycles of F whose union is V(I). Hence C is 
a partition of V(I) into even cycles. It is easy to see that each (PMP) 
P=[P,, Ps, Ps] gives rise to exactly three (ECP)’s, namely, C;=P,U Py, C,= 
P,U P, and C;=P;UP,. On the other hand, for each (ECP) C={C,..., Cy} 
there are exactly 2‘~' (PMP)’s which give rise to C, because each even cycle has 
two perfect matchings and the set of all edges not in C,,...,C, also form a 
perfect matching of IF. With this in mind, we can now establish the following: 


Lemma 3. Every edge in a cubic graph lies on an even number of Hamilton 
cycles. 


Proof. Let I be any cubic graph and e an edge of I. Consider the following table 
each row of which corresponds to a (PMP) and each column of which corresponds 
to an (ECP). As each (PMP) P=[P,, P:, Ps] gives rise to three (ECP)’s 
C, =P, UP, C, =P, UP; and C;= P;U P,, we shall complete the table by filling 
in the three (ECP)’s P,UP,, P,UP;, P,UP, in the row corresponding to 
P =[P,, P,, Ps] and in the columns corresponding to C,, C2, C3 respectively. All 
other entries in this row will be filled up by empty sets. 


Now we shall count the number N of occurrences of the edge e in the whole 
table. Evidently, each row contains the edge e exactly twice. Hence, N=2r 
where r is the number of rows, or the number of (PMP)’s of F. On the other 
hand, if we count over a column corresponding to an (ECP) C= {C,..., Cx}, 
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as this (ECP) occurs at exactly 2“~' locations in the column, the number of 
occurrences of e in the column is either 0 or 2“~' which is always even for k >1 
and is equal to 1 for k=1, in which case C consists of exactly one Hamilton 
cycle. Hence the number N of occurrences of e in the table is equal to the sum of 
the number n, of Hamilton cycles containing e and an even number n). That is, 
N=n,+n,=2r. Therefore n, must be even, as required. O 


Lemma 4. Every Hamiltonian cubic graph contains at least 3 Hamilton cycles. 


Proof. Let I be a Hamiltonian cubic graph. By Lemma 3, G contains at least two 
Hamilton cycles C, and C2, say. There must exist an edge e of I which is in C, 
but not in C,. Again, by Lemma 3, there is a Hamilton cycle C; other than C, 
that also contains e. Evidently C; is different from C2 and so we have at least 
three Hamilton cycles C,, C2 and C;in lr. O 

To end this section, we would like to raise the following question: 


Problem. Does every regular Hamiltonian graph other than a cycle contains 
more than one Hamilton cycles? In particular, does every 4-regular Hamiltonian 
graph contain more than one Hamilton cycles? 


3. Edge Hamiltonian property of Cayley graphs 


With the basic lemmas established in the previous section, we are now in a 
position to prove the following theorems. 


Theorem 5. Let X be a minimal generating set of the group G. Then G(X) is 
Hamiltonian iff it is edge-Hamiltonian. 


Proof. The sufficiency is clear. To prove the necessity, assume that G(X) is 
Hamiltonian. Let C be any Hamilton cycle of G. It follows from Lemma 2 that 
the set A = {a~'b | [a, b] € E(C)} generates G. As this is a subset of XU X~', by 
minimality of X, we must have X c A. Hence for each x in X, there exists an 
edge {a, b] of C with a~'b =x. Thus C contains an x-edge. Hence by Lemma 1, 
G(X) is edge-Hamiltonian. O 


Theorem 6. Let G(X) by a Cayley graph where each element of X is of even 
order. Then G(X) is Hamiltonian iff G(X) is edge-Hamiltonian. 


Proof. Again the sufficiently is clear. To prove the necessity, let x be any element 
of X. We shall show that G(X) contains a Hamilton cycle with an x-edge. Let C 
be any Hamilton cycle of G(X). If C contains an x-edge, then we are through. 
Assume therefore that C does not contain an x-edge. 
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If the order of x is 2, then C together with all the x-edges of G(X) form a cubic 
Hamiltonian graph I in which all the x-edges are ‘chords’ of C. By Lemma 4, I" 
must contain a Hamilton cycle C’ other than C. Thus, IF (which is also a Hamilton 
cycle of G(X)) must contain a chord which is an x-edge that we require. 

Finally, consider the case when the order of x is 2k where k is an integer 
greater than 1. In this case, the set of all x-edges of G(X) forms an (ECP) D of 
G(X). The Hamilton cycle C together with D gives rise to a subgraph Q of G(X) 
which is a 4-regular Hamiltonian graph. As every even cycle contains a perfect 
matching, $2 contains a perfect matching P whose edges are chosen from cycles of 
D. Hence C together with P form a subgraph 2’ of Q which is a cubic 
Hamiltonian graph. Now as in the first part of the proof, 22’ contains another 
Hamilton cycle with an x-edge, which is also a Hamilton cycle of G(X). As the 
element x of X is arbitrarily chosen, the proof that G(X) is edge-Hamiltonian 
follows from Lemma 1. O 


Combining this and the fact that Cayley graphs of prime power order are 
Hamiltonian (Witte [5]), we have the following immediate consequence: 


Theorem 7. Every Cayley graph of order 2* (k =2, 3,...) is edge-Hamiltonian. 


To end this paper, we wish to point out that from the proof of Theorem 6, it is 
clear that if the question to the problem raised at the end of the previous section 
is in the affirmative, then Conjecture 3 will also be established. 
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1. Introduction 


In 1959 Gallai [4] presented his now classical theorem, involving the vertex 
covering number ap, the vertex independence number Bo, the edge covering 
number a,, and the maximum matching (or edge independence) number f,. 


Theorem 1 (Gallai). For any nontrivial, connected graph G=(V, E) with p 
vertices, 
I. a+ Bo=p 
II. a, + B, = p. 


Since then quite a large number of similar results and generalizations of this 
theorem have been obtained, which we will call ‘Gallai Theorems’. A typical 
Gallai theorem has the form: 


a+B=p, 


where a and f are numerical maximum or minimum functions of some type 
defined on the class of connected graphs and p denotes the number of vertices in 
a graph. 

This paper is an attempt to collect and unify results of this type. In particular, 
we present two general theorems which encompass nearly all of the existing 
Gallai theorems. The first theorem is based on hereditary properties of set 
systems, while the second is based on partitions of vertices into subgraphs having 
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treelike properties. We also present a variety of new Gallai theorems (one of 
which is not a corollary of either of the two above mentioned generalizations), as 
well as a number of other new results. 


2. Gallai theorems for hereditary set systems 


In this section we will prove a general theorem concerning hereditary 
properties of sets, from which one can obtain as corollaries a variety of Gallai 
theorems, including the original Gallai Theorem given above. 

Let S be a finite set of n elements and let P be a hereditary property of the 
subsets of S, i.e. P is a function f from the power set of S to {0,1} such that 
f(X)=1 and X’cX implies f(X’)=1. If f(X)=1 we say that subset X has 
property P and that X is a P-set; if f(X) =0 then X is called a P’-set. 

A transversal of a family ¥ of sets is a set T such that |TM F|=1 for each 
Fe F. 


Theorem 2. Let X cS. Then X is a maximal P-set for some hereditary property P 
if and only if S — X is a minimal transversal of the class of all P'-sets. 


Proof. Let X be a maximal P-set and Y any P’-set. Then YM(S— X) #8, 
otherwise Yc X. But this would imply that Y is a P-set since P is a hereditary 
property. Hence, S — X is a P’-set transversal. It remains to show that S — X is a 
minimal transversal. 

Suppose not. Then for some u e S— X, Z=S— X — {u} is a P'-set transversal. 
But X U {u} does not intersect Z. Hence X U {u} must be a P-set, contradicting 
the maximality of X. 

Conversely, suppose that 5 — X is a minimal transversal of the P’-sets. Then, 
since S — X does not intersect X, X must be a P-set. We must show that X is a 
maximal P-set. Suppose for some u e S— X, X U {u} is a P-set. Then since there 
are no P’-sets contained in XU {u}, S—X —{u} intersects all P’-sets, con- 
tradicting the minimality of S— X. Hence X is a maximal P-set, as required. O 


Note that X is a minimal P’-set transversal if and only if X is a minimal 
transversal of the minimal P’'-sets. 

Suppose now that there is a positive weight w(s) associated with each element 
séS. Define a*(P)(a (P)) be the largest (smallest) sum of the weights of the 
elements in a minimal P'-set transversal; similarly, let B*(P)(B-(P)) be the 
largest (smallest) sum of the weights of the elements in a maximal P-set. The 
following weighted Gallai theorems follow immediately from Theorem 2. 


Corollary 2a. For any hereditary property P of a set system, 
(i) a (P) + B*(P)= Yes w(s) 
(ii) a*(P) +B (P)= Lys w(s) 
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In stating the following corollaries of Theorem 2, we will assume that all 
elements in S have unit weight, so that the sum of the weights of the elements in a 
set X equals the cardinality of X. We note, however, that each of the following 
corollaries can be generalized to the arbitrary weighted case. 

For example, Cockayne and Giles [2] noticed Corollary 2a(i) in the unweighted 
case. 

Hedetniemi {7] also obtained special cases of Corollary 2a(i) when the basic set 
S is either the vertex set V(G) or the edge set E(G) of a graph G =(V, E). 

Part I of Gallai’s theorem can be obtained from the unweighted version of 
Corollary 2a(i). We let S = V(G) and we say that a set X cS has property P if 
and only if (X) the subgraph induced by X is totally disconnected (i.e. X is an 
independent set of vertices). In this case, minimal P’-set transversals are minimal 
vertex sets which cover all minimal non-independent vertex sets, i.e. pairs of 
adjacent vertices. Thus, minimal P’-transversals are minimal vertex covers (sets 
of vertices which cover all edges). In this context, B-(P) = i(G), the independent 
domination number (equivalently, the smallest number of vertices in a maximal 
independent set), while a*(P) = a3(G) is the maximum number of vertices in a 
minimal vertex cover. 


Corollary 2b. For any nontrivial connected graph G with p vertices, 
(i) (Gallai {4}) a+ Bo=p 
(ii) (McFall, Nowakowski {14]) ag + Bo =p. 


Let set § be the edge set E(G) of a graph G, where |E(G)|=q. Let a subset 
XcS of edges have property P if and only if X is independent, i.e. X is a 
matching. A minimal P’-set in this case is a pair of adjacent edges. In this 
context, B*(P) = B,(G), the matching number of G; while B (P) = B; (G) equals 
the smallest number of edges in a maximal matching (i.e. a minimaximal 
matching). The parameter a (P) can in this case be seen to equal ao(L(G)), the 
vertex covering number of the line graph L(G) of G. We have: 


Corollary 2c. For any nontrivial connected graph G with p vertices and q edges, 
(i) (Hedetniemi [7] ao(L(G)) + Bi(G) =4 
(ii) ag (L(G)) + By (G) = 4. 


Many Gallai theorems may be obtained by considering a class of forbidden 
subgraphs, letting S = V(G) (or E(G)) and saying that a set X c S has property P 
if and only if the induced subgraph (X) contains no member of &. 

For example, let $= {K3} and S = V(G). 


Corollary 2d. Let a3 denote the minimum number of vertices covering all the 
triangles of a graph G with p vertices, and let B, denote the maximum number of 
vertices in a set S such that (S) contains no triangles. Then 


ay + B; =p. 
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Our next application for Theorem 2 provides another result on matching and 
introduces a new graph theory parameter called the matchability number of a 
graph. Let S=V(G) and let a set X = {x,, X2,...,X,} GV have property P if 
and only if X is matchable, i.e. there exists a set of t independent edges (x;, f(x;)), 
i=1,2,...,t Let af(a;,) denote the maximum (minimum) number of vertices 
in a minimal transversal of non-matchable sets, and let B*(B~) denote the 
maximum (minimum) number of vertices in a maximal matchable set. Notice that 
B*(P) = B,(G) (the matching number). 


Corollary 2e. For any nontrivial, connected graph G with p vertices, 
(i) a, + Bi =p 
(ii) a+ Bo =p. 


From Corollary 2e(i) and Gallai’s Theorem, Part II, we may immediately 
conclude 


Corollary 2f. For any nontrivial connected graph G. 
On(G) = a,(G). 


The matchability number of a graph, B(G), i.e. the smallest cardinality of a 
maximal matchable set of vertices, appears to be a new parameter. It is in general 
neither equal to 6; (G), the smallest number of edges in a maximal matching nor 
equal to B,(G), the matching number. This can be seen from the graphs G, and 
G, in Fig. 1. It is easy to see, in fact, that for any graph G, 


7'(G) = Bi (G) = B (G) <B*(G)= B,(G) 


where y’(G) is the edge domination number of G. A graph G, with Bj =2< 
B =3 and G, with B’ =2<f,=3, where a B)(G,)-set is {(2, 3), (4, 6)}; and 
a B (G)-set is {2,3,4}; a B°(G,)-set is {3,4} and a £,(G,)-set is 
{(1, 2), (3, 4), (5, 6)}. 

Next, we let S = V(G) and let X c S have property P if and only if X contains 
no closed neighborhood of G. We observe that a minimal transversal of the 
P'-sets is a minimal transversal of the set of closed neighborhoods, i.e. a minimal 
dominating set of G. The cardinalities of a largest and a smallest minimal 
dominating set are denoted I'(G) and y(G), respectively. Let B7(G) and Bz (G) 
denote the cardinality of largest and smallest P-sets, respectively. 
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Corollary 2g. For any nontrivial connected graph G with p vertices, 
(i) y+ Bo =p 
(ii) + Br =p. 


Let e(G) denote the maximum number of pendant edges in a spanning forest 
of G. In [15] Nieminen proved 


Theorem 3 (Nieminen). For any non-trivial connected graph G with p vertices, 
yte=p. 


In {8] Hedetniemi observed a duality between Nieminen’s result and Part II of 
Gallai’s theorem as follows. A spanning star partition is a partition [T= 
{V,, Vo,..., Ve} of Vsuch that for 1<i<k, (V,) contains a non-trivial spanning 
star. Define B7(B,) to equal the maximum (minimum) order of a spanning star 
partition of G. Also let a{(a;) equal the maximum (minimum) order of minimal 
edge cover (a set of edges which covers all vertices of G). 


Theorem 3’. For any nontrivial connected graph G with p vertices, 
(i) (Hedetniemi) a; + BY =p = a, + B, (Gallai 11) 
(ii) (Hedetniemi) af + By =p =e + y (Nieminen). 


Thus we conclude from Corollary 2g(i) and Theorems 2 and 3 that 


Corollary 3a. for any nontrivial connected graph G, 


Bi =e= ay. 


Similar results can be obtained when closed neighborhoods are replaced by 
open neighborhoods. In this case minimial P’-set transversals are minimal total 
dominating sets (cf. Cockayne, et al. {3]). 

As our last illustration of Theorem 2, we let S=V(H), the vertex set of a 
hypergraph H with edge set @. For a given positive integer k, we say that a set XY 
of S has property P if and only if jeNX|<k for each ee. Minimal 
P'-transversals are minimal transversals of the sets of vertices which contain at 
least kK +1 vertices of some edge. The results in Corollary 2a(i) and (ii) in this 
context reduce to Corollaries 2b(i) and (ii) when H is a graph and k = 1. 

Finally, we note that one can prove a Boolean dual of Theorem 2 which may 
have interesting special cases. Let Q be an expanding property on the subsets of a 
given set S, ie. Q is a function f from the power set of S to {0,1} such that 
f(X)=1 and X' 2X implies f(X’) = 1. 


Theorem 2’. A set X cS is a minimal Q-set if and only if S — X is a maximal set 
whose union with any Q'-set is not S. 
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3. Gallai theorems from spanning forests 


An interesting variety of Gallai theorems can be obtained from a very 
elementary observation about spanning forests of a connected graph G. Let F be 
a spanning forest of a graph G with p vertices, let e(F) denote the number of 
edges in F and ¢(F) denote the number of connected components of F (i.e. the 
number of trees in the forest). 


Proposition 4. For any graph G with p vertices and any spanning forest F of G, 
t(F) + e(F) =p. 


Next consider the class of all spanning forests satisfying some property P; let 
such a forest be called a P-forest. Let t(P) denote the minimum number of trees 
in a (spanning) P-forest, and let e(P) denote the maximum number of edges in a 
P-forest. 


Corollary 4a. For any graph G with p vertices and any property P of a spanning 


forest 
t(P) + e(P) = p. 


A number of properties P of spanning forests lead to interesting Gallai 
theorems. 

Let P, denote the property that every tree in a forest has at most two vertices 
(equivalently, has diameter <1). Then it can be seen that ¢(P,)=a,(G), the 
vertex covering number, and e(P,) = 8,(G), the matching number. 


Corollary 4b. For any connected graph G with p vertices, 
t(P,) + e(P,) =a + B, =p (Gallai, ID) 
Next, let P, denote the property that every tree in a forest F has diameter <2. 


In this case every tree in F is a star (K,,,) and it can be seen that 


t(P,) = y, the domination number, and 
e(P,) = €, the pendant edge number 


(the maximum number of pendant edges in a spanning forest). 


Corollary 4c. For any connected graph G with p vertices, 
t(P,) + e(P,) = y + € =p (Nieminen). 


One can observe from Corollary 4c, Corollary 2g, and Theorem 3 that there 
are four equivalent definitions of the domination number of a graph G = (V, E): 
(i) the minimum number of vertices in a set D such that every vertex in 

V — D is adjacent to at least one vertex in D; 
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(ii) the minimum number of vertices in a transversal of the closed neighbor- 
hoods of G; 
(iii) the minimum order of a spanning star partition of the vertices of G (note; 
this does not allow isolated vertices); 
(iv) the minimum number of trees in a spanning forest in which every tree has 
diameter <2 (note: this allows isolated vertices). 
Finally, let P, denote the property that every tree in a forest is a path. In this 
case t(P,) equals the minimum number of paths whose union is V(G), and e(P,) 
is the maximum number of edges in a path decomposition of V(G). 


Corollary 4d. For any graph G with p vertices, t(P,) + e(P,) =p. 


Notice that ¢(P,) =1 if and only if G has a Hamiltonian path. 


4. Other Gallai theorems 


Additional Gallai theorems can be found in the literature which involve a wide 
variety of graphical parameters. In this section we mention a few of these. We 
also include a new theorem, which apparently is not a corollary of any of the 
previous results. 

Let y.(G), the connected domination number of G, equal the minimum number 
of vertices in a dominating set D such that (D) is a connected subgraph. Let 
€7(G) equal the maximum number of pendant edges in a spanning tree of G. 


Theorem 5 (Hedetniemi and Laskar [9]). For a connected graph G with p 
vertices, 


¥(G) + €7(G) =p. 


Let A(G), the edge connectivity of G, be the minimum number of edges in a set 
S, such that G—S is disconnected or K,. Definite B,(G) to be the maximum 
number of edges in a set S that (S) is disconnected. 


Theorem 6 (Hedetniemi {6]). For a connected graph G with q edges, 
MG) + Bi(G) = 4. 


This result follows from Corollary 2a. 

Let A*(G) denote the maximum cardinality of a minimal set of edges S, such 
that G — S is disconnected or K,, and let 8B, (G) denoted the minimum number of 
edges in a maximal set of edges S such that (S) is disconnected. Then the 
following theorem also results from Corollary 2a. 
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Theorem 7 (Peters et al. {16]). For any non-trivial connected graph G with q 
edges, 


A*(G) + Bi(G)=q 


Similar results hold for vertex-connectivity. Let x(G) denote the vertex 
connectivity of G, i.e. minimum number of vertices in a set S, such that G — S is 
either disconnected or K,. Let B,(G) denote the maximum number of vertices in 
a set S such that (S) is disconnected or K,. 


Theorem 8 (Hedetniemi {6]). For a connected graph G with p vertices, 
K(G) + B.(G) =p. 


If we replace k(G) by x*(G) and B,(G) by Bz(G) in the above theorem, we 
obtain another Gallai theorem, where x*(G) is the maximum number of vertices 
in a minimal set S, such that V —S is disconnected or K, and B,(G) is the 
minimum number of vertices in a maximal set S, such that (S) is disconnected. 


Theorem 9 (Hare, Laskar, Peters {5]). For a connected graph G with p vertices, 
k*(G) + B.(G) =p. 


We now introduce two new parameters, a,,(G) and B,,(G) of a graph G and 
prove a Gallai theorem involving these parameters, which apparently is not a 
corollary of any of the previous results. As a matter of fact, if k =1, then Gallai 
theorem II results. 

Let k<6, the minimum degree of G. Definite a,,(G) to be the minimum 
number of edges in an edge set X cE such that, for every ve V(G), deg v in 
{X) is at least k. Define B,,(G) to be the maximum number of edges in an edge 
set X c E such that, for every v e V(G), deg v in (X) is at most k. Note that for 
k=1, a,(G)=a,(G) is the edge-covering number and £,,(G) = ,(G) is the 
matching number of G. 


Theorem 10. For a connected graph G with p vertices and k <6, 
a(G) + Biu(G) = kp. 
Proof. Let X be an a,-set, i.e. X is a set of a,,(G) edges such that for each 
veV(G), degu in (X) is at least k. 
Let A= {u | degu in (X) =k} 
B={v|degv in (X) >k}. 


Let A={vj,v2,...,u,} and B={u,,u2,...,u,}. Suppose the degree of 
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u;€B in (X) is k+A;,, i=1,2,...,¢. Note that each A,;>0. Due to the 
minimality of (X), it follows that B is independent. Let Y denote the set of edges 
obtained from X by deleting A; edges from each u,;é B, i=1,2,...,¢. Then for 
each vu, € A, deg v; in (Y) is at most k, and for each u; € B deg u; in (Y) is exactly 
k. In other words, for each v €V(G)degyu in (Y) is at most k, and hence, 
Bu,.(G) =|Y|. Counting the degrees of each v € V(G) in (X) and (Y) we get 


za 


sk+ > (k+A,) + tk +ks - 
i=1 i=1 
=2k(t+5s) 
= 2kp. 
Hence, the sum of the numbers of edges in X and Y is kp and we have 
a (G) + Bix(G) = |X] + |¥1 = kp. 


To show that a,,(G)+ By,(G) <kp, we let X be a B,,-Set, i.e. a set of By,(G) 
edges such that for each v E V(G), deg vu in (X) is at most k. Let 

A= {vu |degu in (X) =k} 

B= {vu | degu in (X) <k}. 

Let A={v,,vU2,...,u,} and B=({u,, uz,...,u,}. Suppose deg u; in (X) is 
k-A,, i=1,2,...,¢t. Note that A, >0. Due to the maximality of X, it follows 
that if uju; eX, then uju; € E. Now, as before we construct a set Y of edges from 
X so that for each v Ee V(G), degu in (Y) is at least k. This can be done by 
adding A; edges to each vertex u;eB, i=1,2,...,¢, and these edges join 
vertices of B with A. Note that |Y| = a,,(G). Counting the degrees of vertices in 
(X) and (Y) we get 


ks + > (k—-A,) + kt + >) A, + sk 
i=1 i=1 


=2k(s +1£) 

= 2kp. 
Thus the number of edges in X and Y=kp. But, a,,(G) + Bu(G) S|X|+|Y|= 
kp. O 


Corollary 10a. For a connected graph G with p vertices, 
@1(G) + Bi,(G) = a@,(G) + B,(G) = p. (Gallai) 
@1,(G) = a,(G), Bii(G) = B,(G) and 
a+ By =a,+ 8, =1-p=p. 


The following theorem is a natural extension of the Gallai Theorem II. The 
proof is essentially the same as that of Theorem 10. 
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Theorem 11. Let f be an integer-valued function defined on V(G) and suppose 
f(v) <deg(v) for each ve V(G). Define a,;(G) to be the minimum number of 
edges in an edge set X < E(G) such that, for every ve V(G), deg(v) in (X) is at 
least f(v). Similarly B,,(G) is defined. Then 


a(G)+By(G)= > fv) 
veV(G) 


In [13], Laskar and Sherk showed that Gallai theorems can be found in the 
study of projective planes, as follows. 
Let IT denote a projective plane of order n, 
Let P denote the set of points of IT, and 
Let L denote the set of lines of IT. 
For any integer m=1, let P,, denote a set of points of IT, such that for every 
leL, 


J1N P,,| =m. 
Similarly, let M,, denote a set of points of I7, such that for every le L, 
|10 M,,| <m. 


Finally, let a, equal the minimum cardinality of all P,,-sets and let B,, equal 
the maximum cardinality of all M,,-sets. 


Theorem 12 (Laskar, Sherk). 


Om t+ Brsi-m=ntnt+l, 1<m<n. 


We also note that a dual version of Theorem 11 also holds. 

In {10] and {11], Hedetniemi and Laskar propose a bipartite theory of graphs, a 
theory in which many of the standard results in graph theory have counterparts 
for bipartite graphs. Among the results obtained in this bipartite theory are 
several ‘bipartite’ Gallai theorems, as follows. 

Let G =(X, Y, E) denote a bipartite graph where edges only join vertices in X 
with vertices in Y. Given an arbitrary graph G =(V, E) we can construct three 
bipartite graphs of some interest, as illustrated in Fig. 2. 

Let G =(X, Y, E) be a bipartite graph. 


r t b t 1’ 1 1’ a t t a 
G 5 ZN 2 2 2° 2 2° b 2 2 b 
| ag i : : . ; : : : 
4 —4+3 4 4 4 4 d 4 4 d 
5 5° 5 x e€ 5 5 e 
: f 

VV wt EV VE 
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A set Sc X is a Y-dominating set if for every y € Y there is an x € S such that 
(x, y) €E. 

A set Sc X is hyper-independent if there does not exist a y € Y whose open 
neighborhood N(y)c S. 

A set Sc X is hyper-dominating if for every x eX —S there does not exist a 
y € N(x) such that 


N(y)cSU {x}. 


Finally, define 8,(G) to equal the maximum order of a hyper-independent set, 
and yy(G) to equal the minimum order of a Y-dominating set of G. 


Theorem 13 (Hedetniemi, Laskar). For any graph G=(V, E), where |V|= 
and |E|=4q, 

(i) ¥(G) + B(VV) =p 

(ii) y(G) + B,(VV*) =p 

(iii) a\(G) + B,(EV) =4 

(iv) ao(G) + B,(VE) =p 


A variety of new Gallai theorems can be generated from an extension of 
Theorem 2 which involves a generalization of the concepts of minimality and 
maximality. These ideas were introduced in {1]. We omit proofs here. We use the 
notation of Section 2. 

A subset X of S is called a k-minimal P-set if X is a P-set but for all ¢ satisfying 
1<¢<k, all €-subsets U of X and all (¢ — 1)-subsets R of S, (X — U) UR is not 
a P-set. We note that 1-minimality is the usual concept of minimality. 

Similarly X is a k-maximal P-set if X is a P-set but the addition of any ¢ 
elements to X where € <k followed by the removal of any @—1 elements, forms 
a set which is not a P-set. Let P be a hereditary property on S and let the new 
hereditary property Q on S be defined by: 

X is a Q-set if and only if X is a transversal of the class of P’-sets. 


Theorem 14 (Bollobés, Cockayne and Mynhardt). X is a k-maximal poe if and 
only if S— X is a k-minimal Q-set. 


Let a,(S, P) equal the minimum cardinality of a k-minimal P-set, and let 
B,(S, P) equal the maximal cardinality of k-maximal P-set. 


Corollary 14a (Bollobés, Cockayne, Mynhardt). Let P be a hereditary property 
on the subsets of a set S. Then 


a,(S, P) + B,(S, P) =|SI. 


Note that for k =1, P-sets are independent sets of vertices and, again, we get 
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Gallai’s theorem 


Q + Bo =p. 


As a final note, we mention that a matroid generalization of Gallai’s theorem 
has been obtained by Kajitani and Ueno [12]. In particular, they establish a one 
to one correspondence between independent parity sets in a partition matroid M, 
induced by incidences of edges in the subdivision graph S(G) of a graph G, and 
matchings in G. 

Given this correspondence, it is easy to see that the complement of an 
independent parity set in the dual matroid M* of M corresponds to a covering of 
G and vice versa. From this observation the following interesting result can be 
proved, a corollary of which is Gallai’s Theorem II. 


Theorem 15 (Kajitani and Ueno). Let n be the number of vertices of a connected 
graph G =(V, E). 
(i) Suppose that X is a maximal matching of G. If Y is a covering of G which 
contains X and is minimal in the coverings containing X, then 


|X|+|Y| =n. 


(ii) Suppose that Y is a minimal covering of G. If X is a matching of G which is 
contained in Y and maximal in the matchings contained in Y, then 


|X|+|Y| =n. 
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The reliability of a network can be efficiently bounded using graph-theoretical techniques 
based on edge-packing. We examine the application of combinatorial theorems on edge- 
packing spanning trees, s, t-paths, and s, t-cuts to the determination of reliability bounds. The 
application of spanning trees has been studied by Polesskii, and the application of s, -paths has 
been studied by Brecht and Colbourn. The use of edge-packings of s, t-cutsets has not been 
previously examined. We compare the resulting bounds with known bounds produced by 
different techniques, and establish that the edge-packing bounds often produce a substantial 
improvement. We also establish that three other edge-packing problems arising in reliability 
bounding are NP-complete, namely edge-packing by network cutsets, Steiner trees, and Steiner 
cutsets. 


1. Edge-packings of graphs 


An edge-packing of a multigraph G is a collection of edge-disjoint subgraphs of 
G. Many combinatorial problems can be viewed as edge-packing, including the 
chromatic index problem and the independent set problem. In this paper, we are 
concerned with edge-packing problems which arise in problems concerning 
network reliablity. The three problems of most interest to us here are edge- 
packings by spanning trees, by s, ¢-paths, and by s, t-cuts. We first review the use 
of edge-packings in the reliability context. 

A probabilistic graph is a multigraph together with a probability of operation 
for each edge. Vertices of the multigraph represent communication centers which 
never fail, while edges represent undirected communication links which operate 
Statistically independently with the stated probabilities. An all-terminal operation 
in such a graph requires that all vertices be able to communicate with one another 
via paths of operational edges, and all-terminal reliability is the probability that 
the network supports an all-terminal operation. A 2-terminal operation for 
specified vertices s and ¢ requires that there be an operational s, ¢-path, and 
2-terminal reliability is the probability that a 2-terminal operation for s and ¢ can 
be performed. A k-terminal operation for a specified set of k target nodes requires 
that each pair of target nodes be able to communicate, and k-terminal reliability is 
the probability of being able to carry out a k-terminal opeation. The evaluation of 
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each of these three reliability measures is a very difficult problem; in fact, they 
are #P-complete [19, 23]. This had led network designers to consider lower and 
upper bounds on the reliability which can be efficiently computed. The goal of 
this paper is to study the application of edge-packings to the production of 
reliability bounds. We also compare some computational results for the bounds 
with bounds obtained from the other main technique, subgraph counting (see 
[6]). It is also important to remark on the existence of efficient methods for 
combining the results of the edge-packing bounds described here with the 
subgraph counting bounds; see [5, 7]. 

We illustrate the general method for obtaining lower bounds by showing how 
an edge-packing by spanning trees leads to a lower bound on all-terminal 
reliability. Let G be a probabilistic graph whose all-terminal reliability is desired, 
and let {7;,..., 7} be an edge-packing of G by spanning trees. Observe that 
the all-terminal reliability R; of the tree 7; is easily computed; it is just the product 
of the edge operation probabilities of the edges in the tree. Now every state of 
the edges of G corresponds to a state of the edges in each tree; a failed state of G 
must yield failed states in each tree, but the converse need not hold. Thus an 
upper bound on the failure probability of G is the probability that all of the trees 
fail; taking complements of each gives a lower bound on the probability that G 
operates, as desired. The bound will be improved in general by having more 
trees, and by having more reliable trees (these two goals may be conflicting). The 
simplest edge-packing problem of interest here is to pack the maximum number 
of spanning trees. The application to 2-terminal lower bounds is very similar, 
except that we consider edge-packings by s, ¢-paths. In the application to 
k-terminal reliability, we consider edge-packing by Steiner trees, subtrees 
connecting all target nodes in which each leaf is a target node. 

Upper bounds arise in a very similar way. A cutset is any set of edges whose 
removal prevents network operation. Cutsets whose removal disconnects s from ¢ 
are 5, f-cutsets, and those which disconnect the network (i.e. prevent all- 
terminal operation) are called network cutsets. Finally, cutsets which separate any 
two target nodes are Steiner cutsets. To develop an upper bound, consider the use 
of edge-packing by cutsets to bound reliability. Let G be a graph, and 
{C,,..., C,} be an edge-packing by cutsets for G. An operational state of G 
corresponds to a state of each cutset in which each has at least one operational 
edge, although the converse need not hold. Hence the probability that each cutset 
has at least one operational edge is an upper bound on the reliability of G. This 
can be easily evaluated, as the cutsets are edge-disjoint. The bound is improved 
by taking more cutsets, and cutsets which are more likely to fail; again, these two 
goals may be conflicting. We consider the simplest case of requiring an 
edge-packing by a maximum number of cutsets. 

Two main points are at issue here. First, we must consider whether the bounds 
developed thus far can be efficiently computed. Second, we must ascertain 
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whether the resulting bounds are of any practical use—in particular, whether they 
are competitive with the subgraph counting bounds. Edge-packing by spanning 
trees, 5, ¢-paths, and s, ¢-cuts all have efficient solutions, while edge-packing by 
network cutsets is NP-complete (see Section 6). In the last case, however, the 
lack of an efficient edge-packing algorithm is more than compensated for by a 
clever bounding technique due to Lomonosov and Polesskii [14] which employs 
‘noncrossing’ rather than edge-disjoint cutsets; noncrossing cuts can be found 
efficiently using the Gomory-Hu algorithm. Edge-packing by Steiner trees and 
Steiner cutsets are both NP-complete; we establish this in Section 5 and 6. We 
first focus on the three efficiently solvable edge-packing problems. The applica- 
tion of the first two in bounding reliability has been studied previously; we review 
these, and then examine more carefully upper bounds on 2-terminal reliability 
from edge-packings by s, ¢-cutsets. 


2. Edge-packing spanning trees 


Tutte [22] and Nash- Williams [17] first addressed the problem of edge-packing 
a graph with spanning trees; Edmonds [8] subsequently obtained a more powerful 
result which gives conditions under which a matroid contains k disjoint bases. The 
application to edge-packing spanning trees is immediate, as the spanning trees are 
precisely the bases of the graphic matroid of the graph. Polesskii [18] observed 
that the Tutte-Nash—Williams theorem has a particularly useful corollary: a 
graph with edge-connectivity 2¢ or 2¢+1 has at least ¢ edge-disjoint spanning 
trees. This corollary leads to a simple expression for a lower bound when each 
edge has operation probability p: an n-vertex graph with edge-connectivity 2¢ or 
2t+1 has all-terminal reliability at least 1—(1—p”'). This is called the 
Polesskii bound, and is of interest largely because it requires no knowledge of the 
actual spanning trees, or a method to find them—it requires only the number of 
vertices and the edge-connectivity. 

One way to improve the Polesskii bound is to employ the actual number of 
edge-disjoint spanning trees. Fortunately, Edmonds’s proof of the sufficiency of 
the Tutte-Nash-Williams conditions is based on an elegant efficient algorithm, 
the matroid partition algorithm. The matroid partition algorithm can be used to 
construct efficiently a maximum set of edge-disjoint spanning trees. In addition to 
incorporating the actual number of trees, knowledge of the actual trees enables us 
to allow different edge probabilities. 

It is difficult to compare the resulting edge-packing bound with other available 
bounds, to ascertain whether it affords some practical improvement. Two main 
competitors exist: the Ball-Provan bounds [2] are the best subgraph counting 
bounds which apply, and the Lomonosov—Polesskii lower bound [15] also 
applies; however, both assume that all edge-probabilities are equal, while the 
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Fig. 1. 


edge-packing bound requires no such assumption. In fact, the edge-packing 
bound is the only basic bound which applies here when different edge probabil- 
ities are allowed, and hence in this context is very valuable indeed! 

When all edge operation probabilities are equal, it is of interest to compare the 
edge-packing bound to the Ball-Provan and Lomonosov—Polesskii bounds. 
Unfortunately, in our experience, the edge-packing bound fares quite poorly. Let 
us consider a small example on six vertices, with each edge having operation 
probability p, depicted in Fig. 1. 

The three bounds evaluated on this graph yield: 


5 


Edge-packing Dp 
Lomonosov-—Polesskii 6p° — 5p® 
Ball-Provan p[1+4q +7q7 + 10q° + 14q* + 189°] (q=1-p). 


For this example, both bounds always beat the edge-packing bound. This 
unfortunate state of affairs persists in other examples. This appears to be a 
serious setback for the edge-packing approach, and may account for the lack of 
attention afforded it. However, when we turn to the 2-terminal problems, 
edge-packing is a winner. 


3. Edge-packing s, t-paths 


Menger’s theorem [16] guarantees that the number of edge-disjoint s, t-paths 
equals the minimum cardinality of an s, ¢t-cutset, and hence network flow 
algorithms give an efficient method for edge-packing a graph with the maximum 
number of s,¢-paths. Brecht and Colbourn [4] examined the application of 
edge-packings of s, ¢-paths to reliability bounds, and found that the edge-packing 
bounds are not only competitive with, but often much better than, other available 
bounds. Once again, no other available bound allows different edge probabilities. 
When all edge probabilities are equal, the best subgraph counting bound which 
applies is the Kruskal-Katona bound. Even in this case, the edge-packing bound 
is better typically in our test cases. A small example to illustrate this is to take the 
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graph of Fig. 1 with s =0, ¢=5. For various values of p, we find the following: 


P edge-packing Kruskal-Katona exact 

0.2 0.0476800 0.0400041 0.0609183 
0.3 0.114570 0.0900919 0.154691 

0.4 0.213760 0.160786 0.293976 

0.5 0.343750 0.253906 0.464844 

0.6 0.498240 0.373437 0.642341 

0.7 0.664930 0.524589 0.798801 

0.8 0.824320 0.707109 0.913646 

0.9 0.948510 0.896093 0.979453 


For this example, we chose an edge-packing with a path of length two and a 
path of length three. There are other edge-packings;, in particular, for this 
example, one might obtain a path of length four and a path of length five. This 
selection would lead to much poorer bounds, despite the fact that it remains an 
edge-packing with a maximum number of paths. 

One of the main open questions here is which selection of paths leads to the 
best bound. It is easy to see that, when all other things are equal, more paths are 
better. Similarly, if the paths are more reliable (shorter in the equal probability 
case), the bound is better. For this reason, Brecht and Colbourn [4] modified the 
edge-packing bound in two ways. First of all, notice that it is not always best to 
take the maximum number of paths; given a choice between one short path or 
two long paths, the better selection will depend on the value of p. One can 
exploit the structure of network flow algorithms to help out here; flow algorithms 
produce edge-packings by k paths from edge-packings by k — 1 paths, and hence 
one can retain the best bound resulting from all number of paths from 1 up to the 
5, t-edge-connectivity. Another improvement can be made by attempting to 
reduce the average length of paths used; this can be accomplished by using a 
minimum cost network flows algorithm, to produce a set of paths whose total 
length is minimum. Both of these heuristics have been found to be quite useful in 
tightening the bounds [4]. 

Edge-packing is largely vindicated as a bounding strategy because of the results 
for 2-terminal lower bounds. It is interesting to note that the availability of a 
powerful minmax theorem, Menger’s theorem, is key to the success of the 
approach. One of the important areas for study here is the determination of the 
best edge-packing bound; even the mincost strategy is just a heuristic method. 


4. Edge-packing s, t-cutsets 


Edge-packing s, t-cutsets has been examined by Robacker [21], and sub- 
sequently by Fulkerson [9, 10] as a dual of the maxflow-mincut theorem. A 
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characterization of the maximum number of edge-disjoint s, ¢-cuts, and an 
algorithm to find them, is remarkably simple. 


Theorem. The maximum number of edge-disjoint s, t-cutsets equals the minimum 
length of an 5s, t-path. 


Proof. An s, t-cut contains at least one edge from every s, ¢-path. Thus, if there 
is an s, t-path of length J, there can be at most / edge-disjoint s, t-cutsets. In the 
other direction, if / is the length of a shortest 5, t-path, we can efficiently construct 
I edge-disjoint s, t-cutsets as follows. Starting at vertex s, label the vertices (using 
a breadth-first search) with their distance from s. An s, ¢-cutset is obtained by 
taking all edges from vertices at distance i—1 to vertices at distance i. A collection 
of | cutsets is obtained by repeating this process for each i from 1 to J; the | 
s, t-cutsets which result are edge-disjoint. This completes the proof. O 


As in the case of s, ¢-paths, this easy theorem gives a strong minmax theorem 
for edge-packing s, ¢-cutsets. One might therefore reasonably expect the resulting 
edge-packing bound to be powerful. Once again, the only serious competitor here 
is the Kruskal-Katona bound [24]. To effect a comparison, we assume equal edge 
Operation probabilities; the Kruskal-Katona bound requires this. Rather than 
content ourselves with small contrived examples, we compare the two bounds on 
the 1979 Arpanet, a real computer network whose analysis is of practical interest 
[3]; see Fig. 2. This network has 59 vertices and 71 edges. We should remark in 
advance that the s, t-cutsets produced by simple breadth-first search are not 
necessarily minimal, we always modify the edge-packing produced to make the 
cutsets minimal. 

We first consider two nodes which are ‘far apart’, nodes 20 and 41. These two 
nodes are at distance ten, the diameter of the Arpanet. The theorem guarantees 
that ten edge-disjoint cutsets can be found. Labelling the vertices via breadth-first 
search and making each cutset minimal gives the following ten cutsets: 


{15, 20}, {19, 20} 

{8, 15}, {17, 19} 

{5, 8}, {17, 18} 

{1, 5}, {5, 22}, {16, 18}, (18, 18} 

{22, 24}, {1, 2}, {14, 16}, {27, 28}, (28, 55} 

{24,31}, {2, 3}, {2,4}, {13, 14}, (27, 30}, (46, 55}, (53, 55}, (55, 57} 
{31, 33}, (31, 42}, {29, 30}, (43, 46}, {50, 53}, (57, 58} 


{33, 38}, (39, 42}, (42, 45}, {29, 32}, (43, 49}, (40, 50}, (50, 52}, 
{58, 59} 


{38, 48}, (39, 56}, {37, 39}, (45, 47}, (32, 34}, (44, 49}, (49, 51}, 
{35, 40}, (52, 54}, (56, 59} 


{37,41}, {41, 44} 
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Fig. 2. 


Of course, this collection of cutsets is not unique, nor do we have any 
assurance that it leads to the best bound. Nevertheless, we can employ any 
edge-packing to obtain a bound, and this it is of interest to see whether this 
simple strategy for producing an edge-packing leads to a competitive bound. We 
therefore compare the upper bound arising from this edge-packing with the 
Kruskal-Katona upper bound. The resulting bounds for various values of p are 
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summarized below: 


Pp Kruskal-Katona edge-packing 
0.1 0.029209 0.000018 
0.2 0.328144 0.003041 
0.3 0.509623 0.032565 
0.4 0.640000 0.123363 
0.5 0.750000 0.280391 
0.6 0.840000 0.477515 
0.7 0.910000 0.677955 
0.8 0.960000 0.847658 
0.85 0.977500 0.912450 
0.9 0.990000 0.960489 
0.92 0.993600 0.974601 
0.94 0.996400 0.985664 
0.96 0.998400 0.993613 
0.98 0.999600 0.998401 


The edge-packing bound is dramatically better than the Kruskal-Katona bound 
for this example. Of course, the Kruskal-Katona bound is hampered in this case 
by the fact that the edge-connectivity is only two; in addition, the edge-packing 
bound is assisted by the fact that the nodes chosen are at great distance. For this 
reason, it is worthwhile to examine another example; to obtain favourable 
conditions for the Kruskal-Katona bound, we select nodes which are highly 
connected. To obtain unfavourable conditions for the edge-packing bounds, we 
select nodes which are close together. Consider taking s = 24, t= 31. Only one 
cutset can be included, and it has size 3 at least. Naturally, we expect a much 
poorer bound as a result (although one must be careful, as the reliability of 
communication between these two nodes is actually much higher than the 
previous pair used). This second example is of interest primarily because it seems 
to be the case where the Kruskal-Katona bound is the best in comparison to the 
edge-packing bound. 

Remarkably, while the Kruskal-Katona bound does ‘win’ for small values of p 
(p <0.18), past this point the two bounds agree to six digits of accuracy. This is 
quite remarkable, as one might expect the easily evaluated edge-packing bound 
to be very poor compared to the more sophisticated Kruskal-Katona bound. In a 
sense, the results here reflect on the poor quality of the 2-terminal upper bounds 
from the Kruskal-Katona method. Nevertheless, they do suggest that edge- 
packing is not only a viable competitor, but is a practical improvement. Our 
purpose here is to illustrate that 2-terminal upper bounds from edge-packing are 
just as promising as the useful 2-terminal edge-packing lower bounds. However, 
much research remains to be done in this area. Primarily, one wants a 
characterization of which edge-packings by s, t-cutsets lead to the best bound, 
and efficient algorithms to find such edge-packings. 
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Turning to k-terminal lower bounds, the situation is not as satisfactory; we 
show here that edge-packing by Steiner trees is NP-complete. The Steiner tree 
problem is a well-known NP-complete problem [11], in which one is required to 
produce a Steiner tree with a bound on the number (or total weight) of edges 
used. Edge-packing Steiner trees carries no such stipulation, and hence its 
NP-completeness does not follow trivially from the NP-completeness of the 
Steiner tree problem. Nevertheless, we are able to prove NP-completeness via a 
transformation the NP-complete problem of determining the chromatic index of a 
graph. 


Theorem. Let G be a graph, and T a set of target vertices of G. Determining 
whether G has an edge-packing of at least k Steiner trees for T is NP-complete for 
any fixed k >2. 


Proof. Membership in NP is straightforward; one need only guess k edge-disjoint 
subgraphs, and verify that each is a Steiner tree for 7. Completeness is shown 
using a reduction from chromatic index of k-regular graphs, which is NP- 
complete for every fixed k >2 [12, 13]. 

Let G=(V, E) be a k-regular graph, an instance of the chromatic index 
problem. Let V={ui,...,u,}, E={e),...,em}, and F={fi,...,f,,}. Form 
a graph H as follows. The vertices of H are the elements of V, E, and F together 
with {c,,...,¢,} and a distinguished element root. The edges of H are as 
follows: 

1. for each e = {v, w} in E, {e, v} and {e, w} are edges. 
2. for each i, {e;, f;} is an edge. 

3. for each e,, c;, {e;, c;} is an edge. 

4. for each c;, {root, c;} is an edge. 

Finally, the target set T consists of V together with root. We claim that H has 
an edge-packing by & Steiner trees for 7 if and only if G is k-edge-colourable. 

Suppose G has chromatic index k. Each colour class in the edge-colouring 
forms a 1-factor of G; suppose that colour i contains the edges with indices J. 
Then form a Steiner tree S, by taking all edges induced on the set of vertices 
consisting of root, c;, all elements in E and F with indices from J, and all elements 
of V. The set of vertices so defined induces a Steiner tree on the target vertices, 
since the edges in a colour class share no vertices. Moreover, since every two 
colour classes are edge-disjoint, so are the Steiner trees produced. 

In the other direction, suppose that H has an edge-packing of k Steiner trees 
for T. Each element of V has degree k in H, and appears in each Steiner tree. 
Hence, each has a unique neighbour in each Steiner tree. In total, there are 4kn 
edges from elements of E to elements of F, which we call cross edges. Now since 
each element of V has a unique neighbour in E, at least 4n cross edges appear in 
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a Steiner tree. Since there are k edge-disjoint Steiner trees, each must therefore 
contain precisely 4n cross edges; in fact, the Steiner trees partition the cross 
edges. Consider a class in this partition. It consists in G of a set of edges which 
meets every vertex—that is, a 1-factor. But then the Steiner trees give a partition 
of G into k 1-factors, which is just a k-edge-colouring. This completes the 
proof. O 


The NP-completeness here severely limits the use of edge-packing bounds for 
reliability in this case. One could, of course, settle for edge-packings of close to 
maximum size; no research along these lines seems to have been undertaken. 


6. Edge-packing network cutsets 


In this section, we consider edge-packing the maximum number of network 
cutsets. Since this is the special case of edge-packing Steiner cutsets in which all 
vertices are targets, an NP-completeness result here establishes NP-completeness 
for edge-packing Steiner cutsets as well. It is instructive to first observe two lower 
bounds on the maximum size of an edge-packing. Every s, ¢-cutset is a network 
cutset, and hence an edge-packing by s, ft-cutsets is also an edge-packing by 
network cutsets. This holds for any choice of s and ¢t, and hence a lower bound on 
the maximum size of an edge-packing is the diameter of the graph. 

Another lower bound results from considering independent sets of a graph. For 
each vertex of a graph, define the star of the vertex to be the set of edges incident 
with the vertex. The star of a vertex is a network cutset. Taking stars of all 
vertices in an independent set gives an edge-packing by network cutsets, and 
hence the maximum size of an edge-packing is at least the maximum size of an 
independent set. Since the independent set is NP-complete, the NP-completeness 
of edge-packing network cutsets would be trivial if these numbers always agreed; 
however, there are graphs whose diameter exceeds the size of their maximum 
independent set (paths, for example) and these graphs have maximum edge- 
packings whose size exceeds the size of their maximum independent set. 
Nevertheless, we can use a simple transformation to reduce independent set to . 
edge-packing network cutsets: 


Theorem. Determining whether a graph G has an edge-packing by at least k 
network cutsets is NP-complete. 


Proof. One can guess a set of k edge-disjoint subgraphs, and verify that each is a 
network cutset, and all are edge-disjoint; hence the problem is in NP. 
Completeness is shown by a reduction from determining whether a graph has an 
independent set of size at least k, a well-known NP-complete problem. 

Let G, k be an instance of the independent set problem. Transform G to 
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construct a new graph H; H is a copy of G, with a new vertex dom added which is 
adjacent to all of the other vertices. We claim that G has an independent set of 
size at least k if and only if H has an edge-packing by at least k network cutsets. 
First notice that the size of a maximum independent set in G is the same as that in 
H. 

Now suppose that H has an independent set of size k. Taking the stars of these 
k vertices gives an edge-packing of H by k network cutsets. 

In the other direction, let H have an edge-packing by k network cutsets. 
Without loss of generality, each cutset is minimal. Hence, each cutset partitions 
the vertex set of H into two connected pieces. For the ith cutset, let X; denote the 
vertices in the resulting connected piece which does not contain dom. Since the 
network cutsets are edge-disjoint, no two of the {X,} share a vertex. In fact, no 
vertex in X; has a neighbour in any X;, unless i=j. But then selecting a vertex 
from each X; gives an independent set of size k, as required. This completes the 
proof. O 


Once again, the NP-completeness here limits the applicability of the edge- 
packing bounds. In this case, however, two methods have been proposed for 
avoiding this complexity. One method is to employ ‘noncrossing’ cutsets, rather 
than edge-disjoint ones; here, an efficient algorithm, the Gomory—Hu algorithm, 
does exist. The application to reliability, due to Lomonosov and Polesskii [14], 
gives useful bounds. A second method is to transform the undirected problem to 
a directed one, and to use arc-disjoint directed cutsets; see [20]. 


7. Concluding remarks 


The main conclusion is that edge-packing provides a viable strategy for 
producing reliability bounds which are competitive with the best subgraph 
counting bounds. In the all-terminal case, the presence of a very powerful 
subgraph counting method combines with the lack of “dense” edge-packings to 
make the edge-packing bound relatively poor. However, in 2-terminal lower and 
upper bounds, a weaker subgraph counting method combined with very powerful 
minmax results on edge-packing make the edge-packing bounds typically better. 
Improvements in the subgraph counting methods seem unlikely to compensate for 
the big improvement which the edge-packing bounds sometimes afford. 
Nevertheless, an analogue of the Ball-Provan bounds for the 2-terminal problems 
would be very useful. 

In conclusion, we should also remark that the edge-packing and subgraph 
counting methods both apply to reliability problems on directed graphs as well. In 
fact, any undirected reliability problem can be easily transformed to a directed 
one [1]. One might therefore examine transforming an undirected problem to a 
directed one and then employing arc-packing bounds on the result. In the 
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2-terminal problems, the bounds are identical. However, in the all-terminal 
problem, a substantial improvment in the all-terminal bounds is obtained; this is 
explored further in {20]. 
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Let i and j be distinct positive integers. Let RG,,; be the bipartite graph whose vertices are 
the i- and j-subsets of {0,1,...,i+j—1} and whose adjacency is given by inclusion. We 
consider an Erdés Conjecture that asserts that the graphs RG,, are hamiltonian. In this paper 
Hamilton cycles in RGg ;9 and RG; are constructed by means of techniques involving group 
actions on graphs and their quotient graphs. 


1. Introduction and background 


Let i, j€ Z with O<i <j and let n =i +j. Let V,; be the family of (i + /)-tuples 
over Z, = {0, 1} whose Hamming weight is either i or j. Let RG; ; be the bipartite 
graph whose set of vertices is V,; with adjacency given by changing exactly j —i 
coordinates of value 0 to value 1 or viceversa. For j =i + 1 we have treated in [1] 
and [4] the hamiltonicity problem on RG; ;,,, which is an Erdés Conjecture on 
subsets of {0 ---i+j— 1} in its vectorial version. 

If I is a group then a group action t:! X G—G on a graph G =(V, E) is 
defined in [2] as a pair (T:.X V-V,¢:I X EE) of compatible actions. In 
fact RG=RG;; admits some group actions that ease the study of their 
hamiltonicity. To describe them it is enough to indicate their vertex set action 
components: 


(0) Back-reading of coordinates: 
To: Z, X RG RG, 
where 7(1, a) = (4;4;-1,---, 0), for a=(ao,... , @;4;-1) and le Z,. 
(1) Complementation: 
T,:Z,X RG> RG, 
where 7,(1, a2) =(1 +4 ,...,1+4;4;-1), where a and 1 are as before. 
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(2) Rotation of coordinates: 

T,:Z;4; X RG—> RG, 
where 7)(4, 4) = (@_,, Q-y4i,---» 4-~-1), for every p € Z;,; and a as above. 
(3) r-interleaving of coordinates, where 1<reZ and g.c.d. (r,i+j)=1: 

Ty: Zei4j)X RG RG, 


where ®(i+/) is the Euler totient and may be replaced by the order k of 
rmodi+j, i.e. the lowest positive integer k such that r*=1 (modi+/j). For 
example, 7;(—1, a) = (ao, a,,..., @_,). 

Let 7,:Z, X RG—RG be the action obtained by composition of 7) and 7,, 
that is back-reading plus complementation. In [1] we showed that there exists a 
quotient graph RH; ;,; of RG; ;,, such that any Hamilton path bearing some mild 
conditions can always be lifted to a Hamilton cycle in RG,;,;. The mild 
conditions are: 

(a) Each (resp. one) end of the Hamilton path in RH,;,, has at least one loop 

(resp. two loops). 

(b) If n =2k + 1 is composite, the Hamilton path passes through a 2-link. 

We will call such a path a cyclable path. 

In fact we define RH,,, as the quotient graph of RG; ; under the actions 1), and 
T. 


Example. Represent vertices in RG; ; as (i + j)-words in the alphabet {0, 1}. If a 
is such a word let (a) be its orbit in RG; ;/t2, that is (a) is the circular vector given 
by a. Let us represent elements in RH,;=RG;,;/to1, t2 by circular vectors of 
Hamming weight j. Thus for i = 2 and j =3 we have RH, as represented in Fig. 
1(a) where each vertex has two additional loops. The induced adjacency in RG, 
goes as follows: Consider for example the vertex (11100). Select a coordinate with 
value 1. There is an adjacent vertex to (11100) that reads backward as follows: 

(a) Write down again 1 for the chosen coordinate. 

(b) Write down all the other values changed, for the remaining coordinates. 

(c) Do not forget that the result has to be read backwards, or better be written 

down backwards. 

If this induced adjacency is modified by forgetting the back-reading, then we 

get the graph H,, of circular vectors of the Kneser graph K.,. Its lifting G,5 


¢11100) €11010) 
(11100) €11010) ~< | 
€11000) €10100) 
(a) (b) 
Fig. | 
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11100 11010 11001 10101 10011 01011 00111 10110 01110 01101 


11000 10100 10001 01001 00011 10010 00110 00101 01100 01010 


Fig. 2. 


amounts to changing the adjacency of RG,, by means of the back-reading of 
adjacent vectors. 

RG, ,/T, is a lifting of RH2, represented in Fig. 1(b), where the 2-links are to 
be noticed. For example the 2-link on the left comes from the fact that 11100 is 
adjacent to both 11000 and 01100. A cyclable path in RH,,; is obtained easily and 
lifts RG, 3/t2 to the Hamilton cycle in RG, represented in Fig. 2. Note that a 
pattern is repeated five times by only rotating uniformly the coordinates one 
position to the left each time, where the position previous to the Oth coordinate is 
the (+ j)th one. Also note that the vertices in the Hamilton cycle above were 
arranged in such a way that the pair of vertices in each column is an orbit of 7. 
This disposition makes the whole graph 11100 symmetric with respect to an 
horizontal middle line. The same property holds for any RG,,, see [2] Theorem 
2: 

In [3] the method described in the example was applied and Hamilton cycles 
were found this way for i=j —1<8, and for odd i=j —2 <6. Variations of this 
method produced in [3] Hamilton cycles in RG,;, for even i=j — 2 <6 and for 
i=j-3=3. 

In (1] a characterization of 2-looped vertices and 2-links in RH, _;.1 was given. 
For example (11---10---0) is a 2-looped vertex in RH;.i4:. Also the other 
elements in the orbit of (11---10---0) by the r-interspersion action 7, are 
2-looped vertices. Note that they are also back-reading symmetrical or palindro- 
mic, that is invariant under 7. In general, (11---10---0)¢€RG,; will be a 
(j —i + 1)-looped vertex, We choose these looped vertices generally as the ends 
of our cyclable paths. 

In Section 2 a cyclable path in RHg,19 found by Cordova for his Master Thesis 
is given. A convenient variation of the described group action approach allowed 
Quintana to find a Hamilton cycle in RG, for his Master’s work and this is given 
in Section 3. 


2. A cyclable path in RGo 10 


If t is a F-action on G, then there is a quotient graph G/t = G/T =(V/T, E/t) 
which is a /abelled directed quotient graph as follows: 
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(1) Each element A of the V/T is labelled with the cardinality |A| of it as an 
equivalence class in V. 

(2) In each A € V/T select a main representative A.Q« A of A. 

(3) Transform each edge é € E/t into an arc e* with arbitrary orientation. If 
the arce* of an edge €€ E/t goes from A to B, then label é@ with the 
integer z € (0, gcd{|A|, |B|}] such that the unique representative e € E of é 
with an end A.0 has B.z = T(z.1, B.O) as its other end, where 1 is the unit 
generator of I. 

A path in RG,,/t2, made up by two zigzag crossed nonhorizontal paths joined 

by a horizontal link, or any of its liftings, in RG,,, is called a spring. 

The vertices of RG; ;/T, (resp. RH;,;) are the circular vectors arising from the 
vectors in C,UC,, (resp. C;). A circular vector will be given as a word in 
parentheses, so that if (A:0= (aga, ---a,-1), A: 1=(a,a2°-+-+a),...,As(n- 
1) = (@,-1€y** * Qn,—2), then (A:0) =(A:1)=---=(A:(n—1)). 

To determine Hamilton cycles in RG, ; we will consider the quotient graph 
RJ,,, = RH,,/T;. We denote any vertex A=[a]eRJ;,, for ae RG, ,;, as to have 
main representative A.0=(a)¢RH;,; We are interested in the 7;-orbit car- 
dinality (oc) of each element of RJ;,;. In (the subset of palindromic vertices of) 
RJ,,,1:, there is a maximal oc, denoted (pmoc(n)), resp. moc(n). According to 
whether Tp is a restriction of t3 or not, pmoc(n) = 5moc(n) or pmoc(n) = moc(n), 
for every positive ie Z. In [3] we found it convenient to try to find for i>4a 
subgraph of RJ; ;41 consisting of: (a) Either a collection {P} of disjoint paths with 
the ends, (resp. inner vertices), of each P having oc = pmoc(n), (resp. moc(n)); 
(b) Or a collection {5} of disjoint cycles on vertices with oc = moc(n). Each P as 
in (a) can be blown up to a cycle S = S(P). By breaking up the S(P)’s into paths 
and plugging to their ends the remaining vertices, we constructed cyclable paths, 
for i =5, 7, 8. For i=6, a similar method was utilized out of (b). For all of this, if 
S=(...,A.5, B.s,...) is oriented, then we denote “A.s=(B.s)"') for S— 
(A.s, B.s) as an oriented path from A to B. The path obtained by deleting (resp. 
considering only) from A.s, its last y vertices is denoted by A sy.s, (resp. 
A y;.5). This is the *;’ convention which will be used in this section and in the 
following one. 

A path p in RH,, is given by its first vertex ®(P)=P, and the sequence 
n(P) = {2,(P)} of the F’-coordinates (tuples in Section 3), that remain unchanged 
from the sth vertex of P to the (s + 1)th one. 

A collection of paths in RJo19 as in (a) above is constituted by the following 
paths P, Q,..., Z where the end vertices have oc = 9 and the inner vertices have 
oc = 18: 


@(P) = [1111111111000000000],  (P) = {1, 18, 9, 17, 8, 16, 7, 14, 6, 13, 14, 
6, 16, 8, 2, 1, 6, 2, 1, 12, 2, 6, 13, 10, 4, 7, 17, 9, 5, 4, 6, 17, 10, 2, 9, 14, 0, 16, 
3, 10, 17, 1, 2, 0, 4, 3, 8, 6, 10, 5, 16, 11, 14, 7, 1, 4, 0}; 
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@(Q) =[1011111111010000000],  (Q) = {5, 18, 0, 10, 11, 0, 10, 17, 18, 12, 
17, 10, 2, 13, 4, 15, 3, 5, 8, 11, 5, 17, 11, 8, 0, 5, 7, 11, 9, 7, 5, 16}; 

@(R) = [0111111110010000010], (R) = {4, 0, 3, 12, 17, 16, 2, 9, 7, 3, 12, 
13, 1, 12, 5, 10, 6, 15, 10, 14, 9, 10, 11, 17, 10, 11, 12}; 

(S) = {0111111110000101000], — «(S) = (3, 10, 4, 0, 6, 9, 1, 17, 2, 6, 10, 3, 
7, 12, 5, 2, 0, 7, 13, 0, 9, 4, 0, 13, 17, 11, 6, 1, 4, 5, 1, 6, 5, 18, 8 1, 6}; 
@(T) = [1111011111000000001],  (T) = {2, 17, 3, 14, 0, 4, 6, 15, 4, 12, 18, 
13, 14, 2, 9, 0, 12, 6, 15, 3, 8, 9, 1, 10, 6, 18, 7, 4, 3, 7, 5, 8 7, 5, 10, 12, 0, 14, 
4}; 

(U) = [0111111110001000100], (U) = {7, 0, 5, 13, 0, 5, 16, 7, 1, 0, 6, 14, 
4, 6, 0, 17, 13, 9, 15, 3, 11, 5, 13, 11, 1, 9, 4, 13, 3, 6, 14, 7, 10, 15, 9, 14, 5, 9, 
13, 3, 18, 13, 6, 8, 15, 16, 4, 11, 1, 0, 12, 15, 18, 12, 17, 4, 5}; 


(V) =[1011111101000101000],  2(V) = {6, 1, 5, 12, 13, 16, 7, 14, 15, 8, 2, 
10, 16, 13, 14, 5}. 
The other vertices of RJ 19 are: 
oc = 6: W = [1011011111000010010); 
X = [1101011111001000010]; 
Y = [1011110110100100010); 
oc =2: 2 =[1011001111010100001). 
The following paths exist for O<s <9: 
o.s =(X.s, Y.(s—1)), as =(Vjo.(s + 2), “To.5, Uny.s, W.(s — 2)), 
B.s =(‘a.(5 +3), Pao-(s +5), Ris-5, Qu.(s + 3)), 
(s taken in Z, or Zo). 


A cyclable path in RHg i9 is given as follows: 


(P,.0, Q).0, 53.6, Tog-1, Vi33;.7, “To.5, Ung.5, W.3, 4, 3, 8, O.7, 0.6, 
“Vio39.7, 3.0, Vig.4, “Vig63-7, 2.1, Vig.2, Sig-0, Uso.2, Ris-1, Qio-4, B.2, B.3, 
B.7, B.5, B.0, *0.4, “Py26;.6, Rs.8, “Rig-4, Th7-2, “So7-7, Vo.3, “Sg.2, 
Uy9.2, 6-7, Si28, Soy, “Py.2, T.0, “Rur-6, “Sar-4, Qs1-2, “Pao-4, 
Usg.1, “Siz-3, “S375, Py1326.6). 


3. A Hamilton cycle in RG,, and a simpler algorithm 


If j=i+2, then 7, is free if and only if either i#2i’, or j#2j', where 
i', j' € Z. (Otherwise, for i = 2i’ and j = 2;’, there is a copy of RG,.;, inside RG, ; 
by means of the graph monomorphism (F, f): RG;.;, = (V'E') > RG, = (V, E), 
given by F(c) =(c, c) and accordingly for f). 
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For RG,», 3-interspersion makes t) independent from t; and moc(16) = 
pmoc(16) = 4. This means that nonpalindromic vertices in RJ, appear in pairs 
+A, —A, such that —A = <(+A) = 7,(1, +A). For nonpalindromic vertices we 
always use a sign, while palindromic vertices do not have a sign (compare [3], 
Example i = 7). 

As an example of a simpler algorithm due to Quintana than the one used to 
obtain the Hamilton cycle above we now construct a Hamilton cycle for the graph 
RG,» by a Slightly different procedure. We redefine RH; = RG/t,, t2. Trying to 
find a cyclable path Q starting off at [11---10---0] and ending up at another 
convenient looped vertex in this redefined RH,,; amounts to be working leaving out 
rule (c) of back-writing for the induced adjacency. This will only affect the effort 
of lifting $2 to a Hamilton cycle in RG, ; as follows: Representing our graph as in 
the Example for RG, ; given above, the vertical links that completed a Hamilton 
cycle in RG, ;/t, with the two corresponding zigzag liftings of 92 change their 
meaning. In fact [11--- 10---0] was previously adjacent to [1---10---00] bya 
vertical link and this was part of the desired Hamilton cycle. Now again we use 
the two zigzag paths but one of them is coordinate-rotated as follows: If 
B = 1111111110000000 in V(RG,.) assume that [B] is in the top vertex part of 
RH,» (following the pattern given for RH, ). Then the other end of the zigzag 
path 42;,) starting at [B] is also in the top part. The other zigzag path 92,,; has its 
end vertices at the lower vertex part and one of them is [A], where A= 
0000000001111111. We lift £2;5, and 92,,; in all the possible ways to RG, but the 
additional edges lifting from vertical edges in RG,/t, now are obtained 
differently. In fact B is adjacent not to A, the main representative of [A], but to 
T,(7, A) = 1111111000000000. Then the zigzag path 92, containing B remains as is 
and is joined through the edge (B, 7,(7, A)) to the zigzag path 7,(7, £2,), where 
Q2, is the zigzag path containing A, to perform a spring path 92,5. We will show a 
cyclable path 92 in RG,» containing the image of 924,. The free end of 
T,(7, 24) S 245 in the lifting of 2 is adjacent to another representative of [B]. 
This allows the formation of a pattern having continuation by uniform coordinate 
rotation up to sixteen times. Finally the completion of the desired Hamilton cycle 
in RG, is obtained. 

The graph RJ; 9/t) determined by r = 3 and 13: Z, X RG; »>— RG, has its set of 
vertices divisible in the following classes: 

P One palindromic vertex P with oc = 1. 

Q: Two nonpalindromic vertices Q with oc = 1. 

R: Three palindromic vertices R with oc = 2. 

S: Fourteen nonpalindromic vertices § with oc = 2. 

T: Three nonpalindromic vertices with oc = 2 and 7;(—1, T) = 7)(1, T). 
U: Seven palindromic vertices U with oc = 4. 

V: Seventy three nonpalindromic V with oc = 4. 

W: Seven nonpalindromic W with oc = 4 and 7;(—2, W) = 7%(1, W). 
X: One nonpalindromic X with oc = 2 and 73(—1, X) = 7,(4, X). 


Two Hamilton cycles in Kneser graphs 69 
All the representatives of V-vertices in RH, 9/7 form a path V in RJ; with: 


&(V) = (0101100111000111], — 7(V) = {(8, 9), (0,2), (3,4), (3,10), (0, 10), 
(6, 8), (8, 14), (10, 11), (13, 15), (4, 8), (3,8), (3, 13), (2,7), (5, 15), (1, 3), 
(1, 14), (13, 15), (2, 7), (1, 11), (11, 15), (7, 14), (1, 13), (2, 13), (2, 3), (11, 15), 
(11, 13), (5, 6), (7, 8), (8, 13), (0, 5), (0, 2), (9, 14), (4, 11), (11, 13), (1, 13), 
(0, 12), (5,7), (2,6), (2,10), (5, 10), (9, 14), (1, 2), (1, 6), (6, 14), (12, 14), 
(1,9), (2,3), (3,14), (10, 11), (2,4), (2,3), (11, 13), (0,1), (2, 10), (6, 11), 
(0, 15), (4,5), (3, 8), (0, 14), (6, 11), (10, 11), (1, 11), (9, 11), (5, 12), (1, 6), 
(4, 10), (10, 15), (6, 10), (3, 13), (3, 11), (3, 8), (0, 13)}. 


The path V is in fact a cycle. Moreover, V lifts to either cycles +V.i in RH) 9, 
for 0 <i $3. 

The following path Y, containing twenty three vertices, has its ends in class R, 
the first seven vertices after each end in class S and the other vertices in class W: 


#(Y) = (1011111010010100], — x(¥) = {(3, 11), (7, 15), (5, 13), (3, 11), (2, 8), 
(5, 13), (4, 6), (9, 10), (3, 15), (12, 14), (10, 0), (2, 8), (7, 11), (4, 6), (9, 14), (3, 11), 
(6, 8), (10, 15), (4, 12), (7, 8), (8, 13), (8, 0)}. 


Class U forms its own path F: 


@(U) = [1110101011100100],  7(U) = {(1, 9), (3,7), (2,8), (12, 14), (4, 6), 
(11, 15)}. 


The remaining vertices are going to be used to join liftings of the paths 
described above. They are: 


, =(1111101001011000],  Q, = [1010111100001101), 
T, =(1110110011100100], 7) = [1110110011001100], 
T, = (1110110001101100],  P, = [1101010101010101], 

X, =(1110100111001001], Rs =[1110101111000001). 


The claimed cyclable path in RH79 is: 


(+V;65.0, U.2, (+V65;.0);31.0, +Y;20.1, R31, +¥20;.1, (—¥31.1)7751.1, 
(+V31;.0);18.0, +7.0, (+V318.2)7'38.2, (U.0)~, (+V;65.2)"",  —V.2, 
(+V18;.2)"!, —7;.0, +V18;.0, (—V18;.0)', +75.0, +V18;.3, (—V.3)7', 
+V:65.3, U.1, (+V65;.3);31.3, +Y;20.0, R3.0, (+¥20;.0);18.0, —7T%.0, 
(+V1;.0);15.0, P,.0, (+V15;.0);7.0, -—Q,.0, —Q».0, (+V7;.0);1.0, X;.0, 
+V1;.0, —X,.0, ((—V;1.0)17;.0)7', 7.0, (—V318.0)7';1.0, (+V31;.3);18.3, 
—T,.0, (—V;18.0)~';49.0, +Q2.0, —V24;.1, (+V65;.1)74, (U.3)7', (+V 565.1)", 
—V;24.1, +Q,.0, (—V;24.0)~'). 
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Note added in proof 


We have learned recently that the Conjecture treated in this paper and 
attributed to Erdés appeared for the first time in written form in J. Havel, 
Semipaths in directed cubes, in Graphs and Other Combinatorial Topics, B.59, 
Teubner Texte zum Mathematik (Teubner, Leipzig) (3igme Symposium Tcheco- 
slovaque de Th. graphes, Nov. 1982). 
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An alternative and simpler proof of the following result is given: Every r Xs generalized 
partial latin rectangle Q on A= ({t,2,...,k} can be extended to an n Xn generalized latin 
square on A if and only if n=r+s—min{M(i)|ie A}, where N(i) denotes the number of 
times that the symbol i appears in Q. 


1. Introduction 


A latin rectangle of size r Xs on a set of k symbols 1, 2,..., k is anr X 5 matrix 
in which every entry is an integer between 1 and k and each integer occurs at 
most once in each row and each column. In [5], Ryser proved that every r xs 
latin rectangle Q can be extended to a k Xk latin square on the same set of 
symbols 1, 2,..., if and only if each symbol occurs at least r +s — k times in 
Q. 

In this paper, we study a generalized version of latin rectangles which was 
studied by Andersen and Hilton [1-4]. In Theorem 2.3, we prove that every 
m Xn generalized latin rectangle can be extended to form an n Xn generalized 
latin square on the same set of symbols. We also prove that every rXs 
generalized partial latin rectangle Q on A = {1,2,..., k} can be extended to an 
n Xn generalized latin square on A if and only if n=r+s—min{N(i) |i € A} 
where N(i) denotes the number of times that symbol / appears in Q. Our proof is 
elementary and different from the one given by Hilton [4], who made use of de 
Werra’s theorem on balanced edge-colouring of bipartite graph [6]. In the last 
section, we illustrate by an example the two different approaches. 


2. Generalized latin rectangle 


Definition 2.1. Let L =(L,) be an m Xn (m <n) matrix and A = {1, 2,..., k} 
satisfying the following: 

(a) L, cA for all 1 <i<m,1<j<n 

(b) @#U (L,|f=1,2,...,n} cA for all 1<i<m 

(c) B#U{L,|i=1,2,...,m} cA for all 1<j<n 

(d) L, NL, =@ for all p#q, 1<p,q<n 

(e) L, NL, =9 for all r#s, 1<r,s<m 
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Then L =(L,) is called a generalized partial latin rectangle on A. A generalized 

partial latin rectangle L = (L,) on A is called a generalized latin rectangle on A if 
forall<i<m, U{L,|j=1,2,...,n} =A. 


L=(L,) is called a generalized latin square on A if 
(a) L=(L,) is a generalized latin rectangle on A 
(b) m=n 
(c) U{L,|é=1,2,...,m}=A forall l<j<n 


Definition 2.2. Let L=(L,) be an mxXn (mn) generalized partial latin 
rectangle on A. Anr Xs (r =m, 52n) generalized latin rectangle L* = (Lj) on 
B, BDA, is called an extension of L if 


Li =Lyforali<i<m, 1<j<n. 


If L has an extension L*, then we say that L can be extended to L*. 


Lt 
| 


18 U8 
Diagram of L*. 


Theorem 2.3. Every m Xn (mn) generalized latin rectangle L=(L,) on k 
symbols 1,2,...,k can be extended to form an n Xn generalized latin square 
L* = (Lj) on the same set {1,2,..., k}. 


Proof. Let L=(L,;) be an mxXn (mn) generalized latin rectangle on 
{1,2,...,k}. Form a set of m X n (0, 1)-matrices P,, P,,...., P, from L =(L,) 
as follows: 

For all l<i<m, 1<j<n, 1<t<k 


1 ifteL, 


i, j) entr of R= | 
Gi) : 0 otherwise. 


Since L =(L,) is a generalized latin rectangle, each symbol ¢ (t=1,2,...,k) 
appears exactly once in each row and at most once in each column of L. It follows 
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that each P, is an m X n (0, 1)-matrix with only one 1 in each row and at most one 
1 in each column. 
Augment each P, into an n Xn matrix P** where (i, j) entry of 


p* =fe entry of P, forall l<i<m,1<j<n 
0 otherwise. 


It follows that each P* has (n —m) zero rows and (n —m) zero columns. For 
each P*, arbitrarily match each zero row to a zero column. Indicate the chosen 
matching by assigning value 1 to (i,j) entry of P? if row i of P/ is matched to 
column j of P*. Let L* =(L3) be an n Xn matrix where L¥ = {t| (i, j) entry of 
Pf is 1, 1<t<k} for all 1 <i, j<n. Then L* =(Lj) is ann Xn generalized latin 
square on {1, 2,...,k} since each P* is ann X n (0, 1)-matrix with exactly one 1 
in each row and each column. Furthermore, it is clear that (i,j) entry of L* 
equals (i, /) entry of L for all 1<i<m, 1<j<n. Hence L* is an extension of 
Le AJ 


Corollary 2.4. Every mxXn (mn) latin rectangle L=(L,) on k symbols 
1,2,...,k can be extended to form an n Xn generalized latin square L* = (Lj) 
on the same set {1,2,...,k}. 


Proof. Since every latin rectangle L =(L,;) is a generalized latin rectangle with 
each L;, 1<i<m, 1<j <n, being a singleton set, the result then follows from 
Theorem 2.3. 0 


Theorem 2.5. Every r Xs (r<s) generalized partial latin rectangle Q =(Q,;) on 
A=({1,2,...,k} can be extended to an n Xn generalized latin square L* = (L;;) 
on A if and only if 


n=r+s—min{N(i)|ieA} 


where N(i) denotes the number of times that symbol i appears in Q. 
Proof. 


Necessity. Let Q =(Q,;) be an r Xs (r <s) generalized partial latin rectangle on 
A=({1,2,...,k}. Suppose that Q=(Q,) can be extended to an nxn 
generalized latin square L* = (Lj) on A, which is subdivided as shown on top of 
page 74. 


It is required to prove that n =r +s — min{N(i) | i ¢ A}. Suppose on the contrary 
that there exists i¢ A such that n<r+s-—N(i). This implies that r —N(i)> 
n-S. 

Since r — N(i) is the number of rows that symbol i does not appear in Q and 
L=Q+X isa generalized latin rectangle, X must have at least r — N(i) columns. 
But the number of columns in X is n—s and n—s<r-—WN(é), a contradiction. 
Hence n =r +5 — min{N(i) | A}. 
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Sufficiency. Let Q = (Q,;) be an r Xs (r <s) generalized partial latin rectangle on 
A. Let ny =r+s—min{M(i)|ie A} andn2=n,. 


Case 1: n=n, 
Let M, =A-\ {Qj |j=1,2,...,5} for all 1<i<r. Let L=(L,) be anr Xn 
matrix defined as follows: 
(i) For all 1 <i<r, 1sj<s, Lj, =Q, 
(ii) Ligsaty = Mi, Lig =O for alls +2<q<n 
(iii) For all 2<i<r, 5+1<j<n, te L,@teM, and j is the least integer in 
{s+1,...,m} such that 


t¢U{L,|p=1,2,...,i-1}. 


Then L =(L,) is an r Xn generalized latin rectangle on A. By Theorem 2.3, L 
can be extended to form an n X n generalized latin square L* = (Lj) on A. 


Case 2: n>n, 

By Case 1, we can extend Q to an n, X n, generalized latin square L* on A. Let 
B be an (n—7,) X (n—17,) generalized latin square with each diagonal entry 
equal to A and empty elsewhere. 
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is the required n X n generalized latin square on A. O 


Theorem 2.5 can be reformulated as follows: 
Theorem 2.6 (Hilton [4], Theorem 2). Every r Xs (r <5) generalized partial latin 
rectangle Q =(Q,) on A={1,2,...,k} can be extended to ann Xn generalized 


latin square L* = (Lj) on A if and only if 
N(i)2=rt+s—n  forallieA 


where N(i) denotes the number of times that symbol i appears in Q. 


3. Example 


Consider the 2X3 generalized partial latin rectangle Q=(Q,;) on A= 
{1, 2, 3, 4, 5} as shown below: 


We shall illustrate how Q =(Q,,) can be extended to a 5X5 generalized latin 
square L* = (Lj) on A by two different approaches, namely 

(a) the approach used in the proof of Theorem 2.5 

(b) the approach used by Hilton [4] based on de Werra’s theorem [6]. 


Approach (a). In what follows, we shall use the same notations as in the proof of 


16 C.-L. Deng, C.-K. Lim 
Theorem 2.5. 


M, = {5} 

M2 = {2,5} 

L=(L,) isa2 x5 matrix with 
Ly=Q,; forall<i<2, 1<j<3 
Ly4= {5} 
Ly = {2} 


and 
L,,= (5) 


Diagram of L 


Hence, L is a 2 X5 generalized latin rectangle on A. We then form a set of 5 x 5 


(0, 1)-matrices P{,..., P$ as obtained in the proof of Theorem 2.3. 
10 0 0 0 01 0 00 00 1 00 
01 0 00 00 0 10 10 0 0 0 
00 1 0 0 10 000 0 1 0 0 0 
00 0 1 0 00 100 00 010 
00 0 01 00 001 00 001 

PY P3 P3 
00 1 0 0 000 10 
01 0 0 0 00 001 
10 0 00 10 0 00 
00 0 1 0 01 0 00 
00 0 01 00 1 00 
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Then 


is a 5 X 5 generalized latin square on A. 


Approach (b). In the proof of Theorem 2.6, Hilton [4] made use of the following 
theorem: 


Theorem (de Werra) [6]. For each k = 1, any finite bipartite graph has a balanced 
edge-coloring with k colors. 


Step 1: Extend Q to a2%S5 generalized latin rectangle L on A. 

Form a bipartite graph G(X, Y) as follows: 

The vertices in X correspond to the rows of Q, say X = {R,, R2}. The vertices 
in Y correspond to the symbols in A, say Y= {1, 2,3, 4, 5}. R; (1<iS2) is 
joined to y (1 <y <5) in Y if and only if y does not occur in row i of Q. 

Since the degree of each vertex in Y is less than or equal to two, by de Werra’s 
theorem, G has an equitable edge-coloring with two colors, say color 4 and color 
5 (for simplicity we use colors 4 and 5 to indicate the new columns 4 and 5). Place 
symbol y into entry (i, j) of L if and only if there is an edge (R;, y) with color j. 
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Diagram of G(X, Y) 


We thus obtain the following generalized latin rectangle L: 


Diagram of L 


Step 2: Extend L to a5 X5 generalized latin square L* on A. 

Form a bipartite graph G*(X*, Y*) where vertices of X* correspond to 
columns of L and vertices of Y* correspond to symbols in A, say X*= 
{C1, C2, €3, C4, Cs} and Y* = {1, 2, 3, 4, 5}. c, € X* is joined to y € Y* if and only 
if y does not occur in column i of L. 

As in Step 1, we can give G* an equitable edge-coloring with 3 colors, namely 
colors 3, 4 and 5. 
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Diagram of G*(X*, Y*) 


With this coloring, we obtain L* as follows: 


Diagram of L* 


79 
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Remark. As illustrated from the above example, it is clear that our approach in 
extending a generalized partial latin rectangle to a generalized latin square is 
much simpler than the one given by Hilton [4], especially when the corresponding 
bipartite graph has large maximum degree. 
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I wrote many papers with this and similar titles. In my lecture I stated several 
of my old solved and unsolved problems some of which have already been 
published elsewhere. To avoid overlap as much as possible, I state here only 
relatively new problems. 

First I state two recent problems of Nesetril and myself. 

1. Let G be a graph each vertex of which has degree not exceeding n. It is true 
that if our G has more than 5n?/4 edges then G contains two strongly 
independent edges? (i.e. two edges, which are vertex disjoint and for which the 
subgraph of G induced by the vertices of these two edges contains only these two 
edges). 

Very recently Fan Chung and Trotter and independently and simultaneously 
Gyarfas and Tuza proved this conjecture. The proofs are quite complicated. It is 
easy to see that the result is best possible. 

We also formulated the following much more difficult and interesting Vizing 
type conjecture: Let G be a graph each vertex of which has degree not exceeding 
n. Is it then true that G is the union of at most 57/4 sets of strongly independent 
edges? If this conjecture fails one can try to determine the smallest f(n) so that 
every G each vertex of which has degree <n is the union of f(n) sets of strongly 
independent edges. f (n) <2n? is easy. 

One could perhaps try to determine the smallest integer h,(n) so that every G 
of h,(n) edges each vertex of which has degree <n contains two edges so that the 
shortest path joining these edges has length =r. The order of magnitude of h,(n) 
is easily seen to be n’*! but the exact value of A,(n) is unknown. This problem 
seems to be interesting Only if there is a nice expression for h,(n). 

Our second problem states as follows: Let G(n) be a graph of n vertices 
X,...,X,. A subset x;,,..., xX, iS said to be a minimal cut if the omission of 
these vertices disconnects G(n), but no subset x,,,...,%;, disconnects G(n). 
Denote by c(n) the maximal number of minimal cuts a G(n) can have. Seymour 
observed c(3m + 2)2 3”. To see this let G(3m + 2) have the vertices x, y and 
there be m independent paths of length 4 joining x and y. Perhaps c(3m + 2) = 
3”, we could not even prove that c(n)'"—> a <2. 
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2. Gallai conjectured more than a year ago that if G(n) is a graph which 
contains no wheel (i.e. a cycle and a vertex joined to all the vertices of our cycle) 
then G(n) contains at most 4n? triangles. It is easy to see that this conjecture if 
true is best possible. Let |A| = 4n, |B| = [4(n + 1)], join every vertex of A to every 
vertex of B and add in B a matching. It is well known and not hard to see that 
every graph of 4n?+4n +1 edges contains a wheel (i.e. our graph is the largest 
graph which has no wheel). Unfortunately this does not seem to help with 
Gallai’s conjecture. 


3. Recently Gallai and I posed the following problem: Denote by h(n) the 
smallest integer so that every G(n) has a set of <h(n) vertices x1,...,%,, for 
which every clique of G() contains at least one of these x;,’s. It is easy to see that 
h(n) <n—Vn. We conjecture that h(n) equals to the smallest integer for which 
every graph of n vertices which has no triangles has a set of at least n —h(n) 
independent vertices. To convince the reader that our conjecture is not 
unreasonable consider the set of all graphs on 7 vertices which have no triangle. 
Such a graph must have at least n ~— h(n) independent vertices and there is such a 
graph G,(n) which has no triangle and which has exactly n — h(n) independent 
vertices. Thus to represent all cliques (i.e. in this case all edges) of our graph we 
need h(n) vertices (namely the complement of our largest independent set). Thus 
it was perhaps not unreasonble to assume that h(n) vertices will always suffice to 
represent all cliques of our graph. We made no progress with this conjecture 
which is perhaps completely wrongheaded. We could not make any progress even 
if we assumed that our G(n) has no K(4). In this case we only would have to 
represent all K(3)’s of G(n) and all the edges not contained in a K(3). 

Gallai further conjectured that if G(n) is a chordal graph (i.e. all cycles C,, 
n>3 have a diagonal) then all cliques can be represented by [4n] vertices. This 
conjecture was indeed proved by Aigner, Andreae and Tuza. 


4. The problem of Gallai and myself naturally leads to the problem of Ramsey 
numbers. Many papers on these questions were published and to avoid repetition 
I state here only a few of them and will give an admittedly incomplete list of 
references. 

Denote by r(u, v) the smallest integer so that every graph on r(u, v) vertices 
either contains a complete graph of u vertices or an independent set of v vertices. 
It is more usual to use the following (equivalent) definition: r(u, v) is the smallest 
integer so that if we color the edges of K(r(u, v)) (i.e. the complete graph of 
r(u, v) vertices) by two colors I and II then there is either a K(u) all whose edges 
have color I or a K(v) all whose edges have color II. r(n,n) is the diagonal 
Ramsey number, it is the smallest integer for which the every complete graph of 
r(n,n) vertices whose edges are coloured by two colors always contains a 
monochromatic K(n). r(3, 3) =6, r(4, 4) = 18 (this is an old result of Greenwood 
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and Gleason) and r(5, 5) is unknown. The best current bounds are 


cn2” <r(n,n)< (°") [dog ny’. (1) 


I proved the lower bound in (1) by probabilistic methods. The value of the 
constant was improved by Joel Spencer. The upper bound in (1) was recently 
obtained by Rédi and is not yet published. I offer 100 dollars for a proof that 

lim r(n, n)'"=c (2) 
exists and I offer 10 000 dollars for a disproof. I am of course sure that (2) holds. I 
offer 250 dollars for the determination of c. V2 <c <4 follows from (1), perhaps 
c=2? Let us now give a very short discussion of the non-diagonal Ramsey 
numbers. We have 


Cyn 


<r(3,n)< con" (3) 
r(3, F 
(log n)? logn 

The lower bound in (3) is due to me, the upper bound is due to Ajtai, Komlés 
and Szemerédi, who improved by a factor log log n the previous result of Graver 
and Yackel. It is perhaps not hopeless to try to get an asymptotic formula for 
r(3, n). 

It would be reasonable to guess that for every fixed k and e>0 if n >n,(e, k) 


r(k,n)>nk-t* (4) 


but the proof of (4) presented so far unsurmountable difficulties, even for k = 4. 
At first I thought that the difficulties are only technical and the probability 
method will give (4), but perhaps I was too optimistic. 

The best constructive lower bound for r(n,n) is due to Peter Frankl, who 
proved 


c(log ay : 


ne expe log n 


I offer 100 dollars for a constructive proof of r(n, n)>(1+c)". I am afraid that 
there are easier methods of earning 100 dollars. 

Several of us tried to prove simple inequalities between Ramsey numbers. We 
all failed so far. The main difficulty is perhaps the lack of constructive methods. 
Here is a sample which shows our ignorance: 

Is it true that 


r(nt+1,n)—r(n, n)>cn?. (5) 
‘Clearly’ (?). 
limr(n +1, n)/r(n,n)=C} where r(n,n)'">C. (6) 
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(6) seems quite hopeless at present. V.T. Sdés and I failed to prove 


r3,2+1)—7r(3,n) 


; —0 and r(3,2+1)-r(3,n)>. (7) 


The second inequality in (7) should be perhaps easier than the first. Simonovits 
and I tried unsuccessfully to prove that for every k >4 


lim r(k +1, n)/r(k, n)=% (8) 


(4) is easy for k = 3. 

I just mention one of two problems on generalized Ramsey numbers. r(n; $) is 
the smallest integer ¢, so that if we color the edges of K(t) by two colors then 
either color [ contains a K(n) or color II contains G. It is particularly frustrating 
that (C, is a cycle of length four) 


lim r(n, K(3))/r(n, C4) =” (9) 


has not yet been proved. (9) is an old conjecture of mine. I in fact conjectured 
that the following much sharper (and much more doubtful) result holds: 


r(n;C4)<n?-* forsomee>0O and n>n)(e). 
Szemerédi proved (unpublished) 
r(n; C4) <cn?/(logn)’. (10) 


(10) in view of (3) ‘nearly’ proves (9). To end this chapter I state an old and 
nearly forgotten conjecture of Bondy and myself: Let n be odd. Color the edge of 
a K(4n —3) by three colors. Then there always is a monochromatic C,. The 
analogous conjecture for two colors was proved by V. Rosta and Faudree and 
Schelp. 

Several excellent survey papers on Ramsey numbers were written by Burr and 
Rosta, see also a forthcoming book on this subject by Burr, Faudree and Schelp. 
Faudree, Rousseau, Schelp and J have many papers on this subject. 
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5. During my last visit to Memphis State University Faudreee, Rousseau, 
Schelp and I came across the following nice problem which to our surprise is 
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perhaps difficult. (The problem came up in connection of our work on generalised 
Ramsey theory, but as such is quite independent of it.) Is it true that there is an 
absolute constant c>0 so that every G(n;2n—1) has a subgraph G(m), 
m<n(1—c) so that every vertex of G(m) has degree =3? Faudree could prove 
this with m <n — cVn instead of n(1 —c). C,_1 and an nth vertex joined to every 
vertex of our C,,_, shows that 2n — 1 cannot be replaced by 2n — 2. 

Pésa and I proved that every G(n; n +k) contains a cycle of size not exceeding 
c\nlogk/k and apart from the value of c, this is best possible. It might be of 
some interest to try to obtain the exact size n - g(k) of this cycle for small values 
of k, for example g(1) = 3, g(5) =4. I believe we determined g(k) for k <5, for 
larger values of k the exact determination of g(k) gets more and more laborious 
and tricky. A cycle can of course be considered as a subgraph of degree 2, but 
perhaps our old result with Pésa throws no light on our conjecture, since it is not 
difficult to prove that for every c, there is a c, so that there is a @n;c,n) for 
which every subgraph Gm) each vertex of which has degree =3 has more than 
Con vertices. The existence of a such a graph follows easily by the probability 
method, but a direct construction will perhaps also be easy. We have not 
determined the exact dependence of cz on ¢,. 
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6. Several years ago Sauer and I asked the following question: Let f,(n) be the 
smallest integer for which every G(n; f,(n)) contains a regular subgraph of degree 
k. Trivially £,(n) =n, but we could not get any non-trivial results for f,(m) for 
k =3. In particular we could not prove f,(n)/n— and fi(n)<n't*. A few 
months ago Pyber proved 


fin) <c2k?n logn (1) 
Pyber, Rédi and Szemerédi proved 
cn log logn <f,(n). (1') 


Their proof of both the upper and lower bound of (1) is ingenious. It would be 
nice to improve (1) further and get an asymptotic formula for f,(n) and generally, 
f(r). 

Szemerédi once asked: Denote by F,(n) the smallest integer so that every 
n; F,(n)) contains an induced subgraph of degree k. How large is F,(n)? Again 
it is trivial that K(n) =n. I observed that F(n) <cn3 since it is easy to see that 
every Gn; cn}) contains either a K(4) or an induced K(3, 3). It would be nice to 
improve this if possible. 
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7. Now I discuss some problems on extremal graph theory. Let G be any 
graph, denote by 7(n;G) the Turan number of G, i.e. the smallest integer so 
that every &n; T(n; G)) contains G as a subgraph. Many papers on this subject 
have been published recently (some by myself). Bollobas published an excellent 
book on this subject and Simonovits an excellent survey paper; thus to avoid 
repetitions I will try to mention as much as possible only new problems or 
questions which have been neglected. 

In a paper Simonovits and I investigated the following question: Let 
Gn; T(n; G) + €) be a graph. How many copies of G must our graph contain? 
For large € we get very satisfactory results but we had little success for small 
values of ¢. In particular for which graphs G it is true that every G(n; T(n; 9)) 
contains two (or more generally) many copies of G. If @ is a triangle then 
Rademacher, almost immediately after he heard of the result of Turdn 
(T(n; K(3)) =[4n?] + 1), proved that every G(n; [4n?] +1) contains at least [47] 
triangles and this result is best possible. This result was extended to G(n; [4n?] + 
€) first for small values of g by me and later to a much larger range by Bollobas, 
Lovdsz and Simonovits. On the other hand Simonovits and I conjectured that 
G(n; T(n; C,)) contains c,n? C,’s. If true this result is best possible, but we could 
not even prove that it contains 2 C,’s. Can one characterise those graphs G for 
which every G,(n;T(n;G)) must necessarily contain at least two subgraphs 
isomorphic to G? 

Here I would like to insert one problem on hypergraphs which perhaps will 
lead to interesting problems. I only state the simplest case. A classical problem of 
Turan states: Let T(n; K®(4)) be the smallest integer so that every triple 
system on n elements and T®(n; K®(4)) triples contains a K®(4), i.e. a set of 4 
elements all whose triples are in our system. The determination of T®(n; K®(4)) 
seems to be very difficult. Is it true that such a triple system must contain at least 
two (and perhaps in fact cn) K®(4)’s. Observe that it is easy to see that every 
G(n;[4n?] + 1) contains an edge e and c,n other vertices X,,..., X, So that all 
these vertices form a triangle with e in our G(n; [4n?] + 1). Bollobds and I 
conjectured and Edwards proved that c, =} is best possible. Is it true that every 
3-uniform hypergraph (or triple system) G°(n; T@(n; K©(4)) contains an edge 
e and c,n vertices x,,... , x, So that e and x; are a K)(4) in our hypergraph? This 
problem is of course open even for ¢ = 2. It follows easily from our results with 
Simonovits that a G(n; (1 + €)T(n; K(4))) contains such a system. 
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8. Now I return to ordinary graphs, i.e. (r=2). The results of Stone, 
Simonovits and, myself show that the most interesting open problems are if G is 
bipartite. I first state some of our favourite conjectures with Simonovits. 

Is it true that for every bipartite G there is a rational a, = a(G,) <a <2 for 
which 


lim T(n;G)/n*=c(G), O<c(G)<@ (1) 


exists? Is it true that for every rational a, 1<a@<2 there is a bipartite G for 
which the limit (1) exists? At present these conjectures which are about 20 years 
old are beyond our reach, and in fact we have no real evidence for their truth. 
We have no idea of the possible value of c(G), it would perhaps be reasonable to 
assume that c(G) is algebraic. 

Let G be bipartite. Denote by K(G) =r the largest integer for which G has an 
induced subgraph G' each vertex of which has degree =r. We conjectured that 
then 


1 1 
2-—— < a(G)<2--. (2) 


Thus in particular if r =2, i.e. if G has no induced subgraph each vertex of 
which has degree >2, then 


T(n; G) < cn}. (3) 


On the other hand if G has an induced subgraph each vertex of which has 
degree =3 then our conjecture would imply a(G) > 3. There is some (admittedly 
inadequate) evidence for our conjectures. Let G be the graph defined by the 
edges of the three-dimensional cube. We proved 


T(n; G)<cn! (4) 
and we believe a(G) = §, but unfortunately we could not even prove a(9) > 3. In 
fact even T(n; G)/n}—> & is open. We proved that T(n; G —e) < cn}. Further we 
have 

T(n; K(r, r)-—e) <n?” 


A nice test case of our conjecture is the following: Let G, be a graph of 
1+4+(5) vertices and ¢ + t(t — 1) edges defined as follows: 

The vertices of G, are x; y1, Yo,---5 Ye Z1y-++» 24) X iS joined to all the y’s 
and each z is joined to two of the y’s so that every pair (y,, y;) is joined to exactly 
one z. Our conjecture (3) would imply 


T(n; G,) < cn}. (5) 
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For 1 = 3 (5) is easy and we have no proof for :>4. Omit X from G, then we 
obtain G;. Gj is C, and we know a(G3)~4. Faudree and Simonovits proved 
a($)<3. Perhaps for every ¢ a(,)<3. Unfortunately their ingenious proof 
only works for t= 4. 

Let now G,;, 1<i<r be a family of graphs. Define T(n;G,..., G) as the 


smallest integer for which every Gn; T,,(%, ..., G)) contains one of the G,’s as 
a subgraph. Simonovits and I asked if there is a system G,,..., G, for which 
lim T(n;G,,...,G,)/ min T,,(G,) =0? (6) 
ax f=1,2,..., r 
Perhaps our conjecture (1) can be generalised and there is an a(%,,..., G) for 
which 
lim T(n; G,..., G)/n*oD=e, O<c<o (7) 


Faudree and Simonovits believe that 
a(C,4, 9) < min(a(C,), a(G)) (8) 


and they hope that their method will give (8). 

To end this long chapter I make a few remarks on some questions which 
certainly have not been investigated carefully. Let ¢, be the largest integer for 
which there is a G(k; €,) satisfying 


lim T(n; Gk; &)/ni <<. (9) 


K(2; k — 2) shows that ¢, = 2k — 4 and our conjecture (3) easily gives ¢, = 2k —4. 
Perhaps this can be proved without (3) but by the probability method I could only 
prove that T(n;G(k; 2k —3))>cn?. Perhaps the following problem is more 
interesting: What is the largest integer ¢, for which there is a G(k; t,) satisfying 


T(n, Kk; ty) =0(n?)? (10) 


If (10) holds then “k; f) can of course not contain a C,. It would be interesting 
to determine other forbidden bipartite graphs whose presence prevents (10) from 
holding. I have no good guess about the size of t,, perhaps 


2k — Ck} <% <2k — ck. (11) 
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9. Now I give a very short discussion of extremal problems on hypergraphs. To 
make the paper short I state only a few new problems. A long paper of mine is in 
the press on similarities and differences of extremal problems between graphs and 
hypergraphs and Frankl and Fiiredi have a long forthcoming paper in the J. 
Combin. Theory (A) on this subject. 

Let G® be two triangles with a common edge and G§$) be the following 
G®(6;3) having the vertices x,, x2, X3, X4, Xs, Xs and the edges 
{x1, X2, X3}{X3, X4, Xs} {X2, X4, Xo}. An old problem of W. Brown, V.T. Sés and 
myself asked for the determination or estimation of T(n; GY, G§). Ruzsa and 
Szemerédi proved that 


T®(n; GP, GP) /n? 30 (1) 


but for every e>0 T(n; GY, GY)/n?-*—>&. In fact they prove a sharper 
result. An asymptotic formula for T®(n; G®, GY) seems hopeless at present. 
(1) is certainly a new phenomenon. I then asked if there exists for some ra G 
so that there is an @ for which 


T(n; G)/n*>0 


but for every e >0 T(n; G)/n*-£—> ©. Frankl and Fiiredi found such a G® for 
r=5, w=4. It is not yet known if for r<5 such a hypergraph exists. 

Frankl gave a talk on hypergraphs at our meeting in Hakone. After his lecture 
I asked: Is it true that every G(n; 1+ ("3')) contains our G§” and that the only 
Gn; ("7')) which does not contain our G$? consists of the ("3 ') triples which 
have a common vertex? Frank! informed me that this has already been proved by 
him and Fiiredi but that the proof is not quite simple. I at first though that if 
every vertex is contained in only o(n7) triples then every such G®(n; en”) will 
contain our G$. This was easily disproved by Frank! but perhaps such a G°(n) 
can only have 4n7(1 + 0(1)) edges. I then asked: Is it true that if every pair of 
vertices (x, y) is contained in only <Cni triples then every G®(n; en”) must 
contain our G$? During our excursion Fiiredi found the following nice 
counterexample: : 

Let n =p?+p+1. We will have 2n elements A and L where A corresponds to 
the points and L to the lines of a finite geometry of n points. We divide A into 
two disjoint sets B UC, both having (3 + 0(1))p? elements and both meet every 
line of our finite geometry in (4 + 0(1))4n? points, Fiiredi’s system now consists of 
the (1+ 0(1))4n? triples (x, y, €) where x € B, ye C and @e L where x and y are 
on the line @. 
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Whereupon I modified my conjecture: 

Assume that every (x, y) is contained in only o(n?) triples of our system. Then 
if such a triple system has cn? edges must it contain a G$?? It seems to speak 
against this conjecture that it was born as a response to several counterexamples. 
To end this section I would like to state an old problem of mine which seems very 
difficult: Is it true that for every k and ¢ >0 there is an mg so that for every n > ng 
every Gn; en?) contains either a GY? or a GO(k; k +3)? For k =3 this was 
our problem with Brown and V.T. Sés which was settled by Ruzsa and Szemerédi 
but for k >3 very serious new difficulties appear and the problem is still very 
much open. 

By the way Frank! proved a result on hypergraphs which is related to 
conjecture (1) of the previous chapter. 
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10. To end the paper I state a few miscellaneous problems. First of all here is a 
very nice problem of Tuza. Let 9 be a graph and k the largest integer for which G 
has k edge disjoint triangles. Is it then true that G can be made triangle free by 
the omission of at most 2k edges? K(4) and K(5) shows that if true the result is 
best possible. If true many generalisations and extensions will be possible. 

Rothschild and I posed a few years ago the following problem: Assume that 
every edge of a &n; cn’) is contained in a triangle. Denote by h(n; c) the largest 
integer so that every such graph has an edge which is contained in at least h(n; c) 
triangles. The determination or good estimation of h(n; c) does not seem to be 
quite easy. Szemerédi observed that his Regularity Lemma easily gives that for 
every c>0 

lim h(n; c) = 
and Noga Alon proved that for small c, h(n; c)< c'n}. It is easy to see and well 
known that for c>4 one has A(n;c)>c,n,, but without much difficulty the 
following stronger result can be proved. 

Let e>4n?—cn and assume that every edge of G(n;e) is contained in a 
triangle. Then there is an absolute constant c,; =c,(c) for which our {n; e) has 
an edge which is contained in =c,n triangles. That the result is best possible is a 
slight modification of Noga Alon’s proof that if f(n)—© then there is a 
G(n; (in? — nf(n))) each edge of which is in a triangle but every edge is only in 
o(n) triangles. 

We give an outline of the proof. Let G(n; 4n?— cn) be a graph each edge of 
which is contained in a triangle. Observe first that if (x,, x2, x3) is a triangle of 
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our G, then we can assume that 
u(x,) + u(x.) + u(x3) < n(1 + €) (1) 


For if (1) would not hold then one of the edges (x1, x2), (x1, X3), (x2, X3) were 
contained in at least 4(en) triangles and thus our theorem is proved in this case. 
Henceforth we can assume that (1) holds for every triangle of our graph. 

Assume next that our G(n; 4n?— cn) contains more than 10c vertex disjoint 
triangles (x{?, x$?, x$). Omit all these vertices from our G. Then by (1) we 
obtain a G(n — 30c) which has more that 4(n — 30c)? edges and therefore by an 
old and elementary result of mine it contains an edge which is contained in c‘n 
triangles, by the result of Edwards c’ = } + o(1). 

Thus we can assume that (1) holds and our G has at most 10c vertex disjoint 
triangles. But then by a simple argument at least one vertex is contained in 
n*/130c triangles and therefore at least one edge is contained in at least n/130c 
triangles, which proves the first part of our theorem. 

The proof of the second half of our assertion is very simple. Let |G| =n, 
|A| = & where @, tends to infinity as slowly as we please. |B] =|C| =4(n — @). 
Join every vertex of B to every vertex of C. Divide B and C into @, roughly equal 
disjoint sets B; and C,. Join x;;€A to every vertex of B, and C;,. If €, tends to 
infinity sufficiently slowly our graph has 3(n?) — nf(n) edges and each edge is on 
o(n) triangles as stated. 

It would perhaps be of interest to improve the estimates for h(m;c) and 
investigate what happens if c =c, > 0. 

Pyber and I considered the following related problem. Assume again that every 
edge of a G(n;e) is contained in at least one triangle. Denote by L(n;e) the 
largest integer so that our graph contains at least L(n; e) triangles. Trivially for 
all e L(n;e)2e/3, and the result of Ruzsa and Szemerédi shows that for 
e <cnr;(n) e/3 is exact. On the other hand it is well known and easy to see that if 
e>(1+.e)in? then L(n;e)>c,n* even if we do not assume that every edge is 
contained in a triangle. We thought that perhaps for e > cn” L(n, e) > (4+ 0(1))e. 
The complete bipartite graph with a matching shows that if true this is best 
possible’. It would perhaps be of interest to investigate what happens to L(n; e) if 
e/n’— 0 very slowly. 

Last year Stephan Burr and I came.across the following problem: Let f() be 
the smallest integer for which if we color the edges of K(f(n)) (i.e. a complete 
graph of f(n) vertices) by two colors then there either are two monochromatic 
K(n — 1)’s with a common vertex where the two K(n — 1)’s have different colors 
or there is a monochromatic K(n). Is it true that f(m) =r(n; n — 1)? This is open 
even for n=S. 

The following simple problem of Renu Laskar and myself seems still to be 


1p. Frankl has just proved this conjecture. 
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open. Let g(n) be the largest integer for which every &n; [4n?]+ 1) contains a 
triangle x,, x2, x3 for which the sum of the degrees of the vertices x,, x2, x3 1s 
2g(n). We proved 


(1t+c)n<g(n) <(1 + 0(1))2(V3— 1)n. 


The upper bound is probably best possible. Clearly many generalisations and 
extensions are possible. 
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Several new tools are presented for determining the number of cliques needed to 
(edge-)partition a graph. For a graph on n vertices, the clique partition number can grow cn* 
times as fast as the clique covering number, where c is at least 1/64. If in a clique on n vertices, 
the edges between cn° vertices are deleted, 4<a@<1, then the number of cliques needed to 
partition what is left is asymptotic to c?n”; this fills in a gap between results of Wallis for a < 4 
and Pullman and Donald for a= 1, c >}. Clique coverings of a clique minus a matching are 
also investigated. 


1. Introduction 


Only undirected graphs without loops or multiple edges are considered. The 
graph K,, on n vertices for which every pair of distinct vertices induces an edge is 
called a complete graph or the clique on n vertices. If G is any graph, we call a 
complete subgraph of G a clique of G (we do not require that it be a maximal 
complete subgraph). A clique covering of G is a set of cliques of G which 
together contain each edge of G at least once; if each edge is covered exactly 
once we call it a clique partition. The clique covering number cc(G) of G is the 
smallest cardinality of any clique covering; the clique partition number cp(G) is 
the smallest cardinality of a clique partition. 

The question of calculating these numbers was raised in 1977 by Orlin [6]. 
Already in 1948 deBruijn and Erdés [2] had proved that partitioning K,, into 
smaller cliques requires at least n cliques. Some more recent calculations 
motivating this study include [8] by Wallis in 1982, where it is shown that if G has 
o(Vn) vertices, then cp(K, — G) is asymptotically equal to n; [7] by Pullman and 
Donald in 1981, where cp(K,, — K,,) is calculated exactly for m = 4n; and in [1] by 
Cacceta et al. in 1985, where it is found that at its largest cp(G)—cc(G) is 
asymptotic to 4n?, where G has n vertices. 

Several questions left open in these earlier papers are explored. We obtain 
asymptotic results for cp(K, — Km) for m in the range Vn <m<n, connecting 
the results of Wallis 1982 [8] and Pullman 1981 [7]; for example if m=cn’, 
4 <a<1, then cp(K, — K,,) is asymptotic to c’?n™. We apply bounds developed 
in this connection to bound the maximum value of cp(G)/cc(G) on graphs G with 
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n vertices, showing it can grow as fast as cn? where c > %. We also provide simple 
proofs of some bounds on cc(T7,,) where T,, is K, minus a matching. 


2. Lower bound techniques 


Let G=G,, be a graph with n vertices, with these vertices divided into two 
sets A and B with a and b elements, a + b =n. The edges of G now fall into three 
classes which we call “‘A edges’, ‘“‘B edges’, and “‘connecting edges” depending 
as their endpoints lie both in A, both in B, or one in each. Suppose a clique in G 
contains more than one of the connecting edges of G; then it must contain some 
A edges or B edges or both. If the number of connecting edges in G is large, 
there will not be enough A edges or B edges of G available to combine the 
connecting edges into just a few cliques. This technique is used in Theorem 3 of 
[7], which says (here C + D is the graph that has vertex disjoint copies of graphs 
C and D and all edges between vertices of C and vertices of D): 


Theorem. Let H be a graph with p vertices and m edges. Let q be at least the edge 
chromatic number of H. Then cp(H +K,)=pq—m and any minimal clique 
partition has edges and triangles only. 


The proof depends on the fact that there are pq edges connecting H to K,; two 
lie in the same clique only if that clique contains at least one edge selected from 
the m edges of H, since K, has no edges. 

A similar strategy is used in [1] to produce a sequence of graphs G, with n 
vertices for which cp(G,,) — cc(G,) is asymptotic to 4n?. 

Our goal in this section is to give several lemmas that consider cases where the 
cliques use more than one connecting edge. We are able to extend several 
existing lower bounds on clique partition numbers by this strategy. 

We begin with a purely numerical lemma. 


Lemma 1. Let 07_,e; =c and 07.,e? <d. Then q 2c?/d. 


Proof. Substituting the g equal values c/q for the (possibly distinct) e; preserves 
the sum of the e; and can only reduce the sum of the e7. thus Y#_, (c/q)’<d so 
q(c/q)’ <4, and the result follows. O 


Lemma 2. Suppose the graph G has k edges in side A, no edges in side B, and c 
connecting edges. Then 


2 


Cc 
cp(G) * (2k +0) : 


Proof. If G is partitioned by q cliques and clique i has e; connecting edges, then 
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clique i has e,(e; — 1)/2 edges in side A. Then ¥}7_,e,;=c, and 


kD efe,-2=(¥ e-¥ «)/2=(S fc) /2 


and the result follows from Lemma l. © 


We thus obtain a lower bound for the clique partition number of a clique minus 
a clique: 


Theorem 1. From n=m21, and n#1, 


(n-—m)m? 
kK, - = —__— 
cp( n Kn) (n = 1) 
Proof. There are n —m points and 4(n —m)(n — m —1) edges in “side A” and 
m(n — m) connecting edges; Lemma 2 applies. O 


Corollary 1. If 0<c<1, then 
cp(K,, = Ken) 2 (n a en)(cn)’/(n = 1) os (1 = c)e*n?. 


Here f ~g means that as n>, f/g— 1. 

In [7] there is a corollary of Theorem 3 which gives an exact formula: 
cp(K, — Kn) =4(n —m)(3m—n+1) when n>m2i(n—e) (where e=0 for 
n~m odd, e =1 otherwise). Our result is not as good as theirs for m > 4n (for 
example, for n= 12, m=8, c=4, we get cp(G)> 23 and they get cp(G) = 26) 
but our result gives some indication of the value of cp(K,, — K-n) even if m =cn is 
a small fraction of x. 

Wallis has told us that Rose, a student of Pullman, has obtained exact results 
for m<4n; we have not seen these independent results. 

In a sense, our result fails to be tight for two reasons: 

(1) there may be cliques using no connecting edges, if m is small. 

(2) Lemma 1 uses an averaging process: in actual practice no clique can have a 
fractional number of edges, so the e; are not normally equal in a minimal 
Partition. 


Corollary 2. If 4}<a<1 and m=cn’, then for n large enough cp(K, — Kn) = 
(n — cn*)c?n*/(n — 1) =c?n™. 


This result will be discussed further once the corresponding upper bound is 
found, in Section 4. 


We now turn to results that apply if there are edges in both ‘sides’ of G. The 
first pair of lemmas are useful when the cliques involved are typically very small. 
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Lemma 3. Let a clique K, have u edges in side A, v edges in side B, and s 
connecting edges. Then 


s-~1l<utu+min(u, v). 


Proof. Suppose that A and B have a and b vertices as usual; suppose for 
concreteness that a<b. Now s =ab, u =a(a — 1)/2, and v = b(b — 1)/2. Ifa =b, 
then clearly (defining P(u, v)) 
Ptu,v)=ut+ut+min(u, v)+1—s 

= 3a(a —1)/2+1-a? 

=(a —1)(a —2)/220 
Now whenever b grows by 1, p(u,v) grows by b—a=0, so p(u, v)=0 as 
required. O 


Aggregating a number of such cliques partitioning a graph G, we obtain: 


Lemma 4, Let G have u edges in side A, v edges in side B, and s connecting 
edges. Then 


cp(G)=s —u—v—min(u, v). 


Proof. Suppose G is covered by q cliques with the number of edges in the parts 
of cliques i being u;, v,, and s; Thus Lemma 3 implies 1 25,-—u,;— vu, - 
min(u;, v;), and summing fori=1,...,q, we have 


q=> 1=355,-du,-> min(u, 7) 
i=1 i=1 i=l i=l 


q 
=s—u—vu— >, min(u;, v;) 
i=1 


25-u-vu-— min( > uj, >y v,] 
f=1 i=l 


=s—u—vu-—min(u, v). 


Example 1. Consider the graph G,, defined as follows: 4n vertices are in a clique 
A; the other 4n vertices forming set B are divided into 4 cliques each of 4n 
vertices; all the vertices of A are connected to all the vertices of B. Then, there 
are 3(4n)(4n—1) edges in side A, 2(4n)(4n —1) edges in side B, and 3n? 
connecting edges. By Lemma 4, 


cp(G,) = An *— (dn —4n)- 2(4n? — in) 
= in? + 4(3n). 
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Since cc(G,,) = 4 (each of the 4 cliques covers A and one clique of B, a total of 
4n+4n vertices), we conclude that 


cp(G,,)/ce(G,,) > an’. 


It follows that cp(G,,)/cc(G,,) can exceed cn? where c can be at least 4. 


Lemma 3 is wasteful when the number of clique covering connecting edges is 
large (it is exact only for K2, K3, and for K, and Ks; when they have exactly two 
vertices on one side). Here is another approach useful when one or both sides of 
some connecting cliques are moderately large. Lemma 5 says that goodsized 
cliques (those with at least m vertices on the larger side) use up ‘side’ edges at 
least (m — 1)/m times as fast as ‘connecting’ edges. 


Lemma 5. Let a clique K, have its vertices partitioned into sets A and B of sizes a 
and b,a+b=r. Suppose a =m. Then 


(@ - ab <2 - 1) , 50 -1) 
m 2 oo 


Proof. It is easy to check that 


Pia, 8)= ale 1), 10 ie (m= Dap 
_(1 (a—by (a+b) 
=(-)ab = 2 2 


so we need only check that P(a, b) =0. 

If a=m and b=m, (1/m)ab=max(a, b)=}(a+b) and P(a,b)=0. For 
a=m and 0<b<m-—1, P(a, b) is a decreasing function of b and zero only at 
b=m-1, so P(m, b)=0. Fixing b<m— 1 and supposing a >m, P(a, b) is an 
increasing function of a, completing the proof. O 


The use of Lemma 5 is somewhat tricky; it is included primarily because it 
allows us to cope with the following example. 


Example 2. Dom de Caen asked (question communicated to us orally by Pullman 
and by Wallis) about the clique partition number of the graph G;,, composed of 
three copies of K,, with all vertices in the second copy joined to all vertices in the 
first and third (in our notation, loosely, K, + K, + K,,). In particular, does it grow 
proportionally to n?? We can prove that it does. 

Treating the second K,, as side A and the other two as side B, A has n(n — 1)/2 
edges and B has n(n — 1) edges, with n(2n) connecting edges. Suppose there are 
q cliques in a clique partition, where the ith clique has a; vertices in side A and 6; 
vertices in side B (hence all 5; vertices lie in the first K,, or all lie in the third K,; 
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no edge connects those two cliques). Suppose the q cliques are so ordered that 
(a,, b,),..., (a,, 6,) all have a;<m and b;<m, while (4,41, 6,41), .. +s (@gs 0g) 
all have a; 2m and/or b; =m. Now for j=1,..., 7 we have 


44-1, LY 59> (M—D\a0,— (m0?) 


2 2 
while for j=r+1,...,q we have 
a,(a; — 1) 1), Os (Fa, 
2 2 m ao 


Summing over all q cliques, 


5 a) a,(a;— 1) +> ns 2 (A) 34 b= gn 


i=] 


But 
s a(a,;—1)_ n(n —1) 
fa see "ee 
and 
$= n(n -1), 
so 
3n(n - eo >(T— >)(an 2) (ee ") 
2 m q 
and 


(m— 4) Jn? 
~——— Jn 
q> Ca 1) 
The right hand side of the above inequality, considered as a function of m, has a 
maximum when m = 6 (m must be an integer). Therefore, q > 2n?/125. 


Thus de Caen’s conjecture that this graph has a fast-growing clique partition 
number is correct. However, our methods do not establish a large enough value 
of cp(G;,) to suggest that cp(G3;,,)/cc(G3,) grows as fast as in Example 1. Of 
course, in neither case have we established an exact value for cp(G); we have 
only a lower bound. 


3. Upper bounds for a clique minus a clique and cp/cc 


Here we modify a strategy used in {8] to provide an upper bound for some of 
the clique partition numbers bounded below in Section 2. 


Theorem 2. [f m=f(n) and for large enough n, Va<m<n, then cp(K, — 
K,,) < m? + o(m?’). 
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Proof. Let p be a prime power at most slightly larger than m; there are constants 
O0<c<1 and 0<b<1 so that for large enough m, there is a p with 
m<p<m+cm’. Then p?=m’+0(m’). In a projective plane of parameter p, 
delete a line of p + 1 points leaving p? points in p “parallel” lines. In one of those 
lines, delete all but m points; in the other lines, delete a total of (p?— 1) —(p— 
m) points. This leaves a total of n points, with m of them on a selected line. Use 
this design to construct a clique partition of K,—K,, into at most p?+p-—1 
cliques: each line is a clique, except the selected line of m points. There are p — 1 
other “‘parallel” lines and p” “crossing” lines. Hence 


ep(K, — Kn) <p? +p — 1<m? + 0(m?) 


as desired. O 
Corollary 1. If m= cn with }<a <1, then cp(K, — Km) is asymptotic to c?n™. 


Proof. An upper bound is given by Theorem 2 and a lower bound by Corollary 2 
of Theorem 1. O 


This result fills in most of the gap between the results of Wallis (if m<Vn, 
cp(K,— K,)=n) and of Pullman and Donald (if m=4n, cp(K,— Km) = 
4(4n)(4n + 1) ~4(4n)*). There is still a gap: our result is poorer than that of 
Pullman and Donald by a factor of two. 

More generally, we do not get as clean a result as Corollary 1 for the case 
m=cn; Corollary 1 to Theorem 1 gives a lower bound of (1—c)c?n? while 
Theorem 2 yields an upper bound of c?n?. 

We now turn to providing an upper bound for cp(G)/cc(G). We are able to do 
little here other than some delineation of the problem. We have seen in Section 2 
that if G has n vertices we can have cp(G)/cc(G) > &n. How big can it get? It is 
already known from {3] that 1<cc(G)<cp(G) <4n’. If cc(G)=1, then also 
cp(G) = 1 so we need consider only cases with 2<cc(G). Hence, we already see 
that cp(G)/cc(G) < 4n”. The following proposition improves this result slightly 
for large enough n. 


Proposition 1. If G has n vertices, and n is large enough cp(G)/cc(G) < pn’. 


Proof. If cc(G) =3, we are done. Thus we can suppose cc(G) = 2 and must show 
that cp(G) <4n?. In fact, we do better: we obtain cp(G) < in’. 


Since cc(G) = 2, G can be covered by two cliques K, and K, intersecting in a 
clique K,; a +b —c =n. Suppose for concreteness that a <b. If c< 4n, cover Ky, 
with 1 clique and partition G—K,=K,—K, with c?+0(c’) cliques. Since 
c?< 4n, for n large enough cp(G) < 4n’ as desired. 

If c>4n, then let c=4n +x for x >0. Since a<b, a—c<3(n—c)<4n—}4x, 
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and therefore c > 4a. By [7] K, — K. can be partitioned by exactly $(a — c)(3c — 
a+1) cliques. Also, there are (a —c)c edges between K, and K,_. (the clique 
with vertices in K, not in K.). We will consider two clique partitions of G: (1) a 
clique partition of K, — K, along with the clique K,, and (2) the edges between 
K, and K,_. along with the clique K, and K,_.. If either of these partitions has at 
most 4n? elements, the proof is complete. Thus, we need one of the following 
inequalities to hold. 


aye 

3 2/\3 2 n? 

ge ee 
2 8 

n x\/n ° nr? 

ao) Mae aoe 2 

(5 (5+) a (2) 


The inequality (2) is satisfied for |x — in| <2, and the inequality (1) is satisfied for 
the remaining values of x. This completes the proof of the proposition. O 


(1) 


4. A clique minus a matching 


In {6] Orlin defines 7, (not a tree) to be the graph obtained by deleting a 
perfect matching (a set of n edges, no vertex on two of them) from K>,. He asks 
about clique coverings and clique partitions of 7;,. In [5] Gregory and Pullman 
establish that cc(7>,) = log n; in [4] Gregory et al. show that cp(T7,) =n for n=8 
and that asymptotically, cp(7,) <n log log n. The last upper bound is proved by 
methods strikingly similar to those in the previous section. 

We here offer bounds on cc(7,) obtained by methods motivated by the 
heuristic discussion in {6]. These results are less precise than those in {5], but may 
be easier to visualize. 

In order to discuss clique coverings of 7,, we need some notation for the 
vertices. Suppose m= 4n,; T,, will be considered to have vertices a; and b, for 
i=1,...,m. All edges are present except the edges from a, to 6, for 
i=1,...,m. Note that no clique in a covering can contain an a; and the 
corresponding 6, but that there must be cliques containing each a,, b; pair with 
i#/j as well as ones containing each a;, a; pair and each b,, b, pair. 


Theorem 3. For all n, cc(T,,) = (log n) — 1. 


Proof. Clearly 7, cannot be covered by one clique. Given i #/, there must be a 
clique in the covering containing a; and 6,; that clique cannot also contain a,. 
Hence, for each i # j, there is a clique containing a; but not a,. But it thus follows 
easily that there are at least log(4n) cliques. (There is a clique containing a, but 
not az. Since there are 4m a;’s, this clique either includes at least 4n a,’s or 
excludes at least 4n a,’s. Choose the larger such set—the included or excluded 
a,’s—and find a clique separating two of them. Continue log(4m) times). 
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Theorem 4. For all n, cc(T,,) <= 2(log n). 


Proof. We construct an explicit clique covering. For each a; write our the 
subscript i as a binary integer of log(4n) digits (e.g. 0001, 0010, 0011, 0100,...). 
If 4n is a power of two, code the last i as 0000 to avoid the need for an extra digit. 
Clique A, will include all a; for which the kth digit of i is a 1, and all 6; for which 
the kth digit of i is 0. Clique B, has the vertices not in A,. Since if i#/j, i and j 
differ in at least one binary digit, the edge from a; to 6, is in at least one A, or B,. 
Finally, add a clique containing all the a; and a clique containing all the 6,; this 
yields a complete clique covering having at most 2(log(4n)) +2 cliques. O 
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For a positive integer n and graph B, f,(n) is the least integer m such that any graph of order 
n and minimal degree m has a copy of B. It will be show that if B is a bipartite graph with parts 
of order k and / (k </), then there exists a positive constant c, such that for any tree 7, of 
order n and for any j (0<j <(k — 1)), the Ramsey number 


r(Tq, B) <n +c (fg(n))"O-Y 


if A(T,,) <(n/(k —j — 1))— Gi + 2) «fg(n). In particular, this implies r(TJ,,, B) is bounded above 
by n + o(n) for any tree T, (since fg(n) = o(n) when B is a bipartite graph), and by n + O(1) if 
the tree 7, has no vertex of large degree. For special classes of bipartite graphs, such as even 
cycles, sharper bounds will be proved along with examples demonstrating their sharpness. 
Also, applications of this to the determination of Ramsey number for arbitrary graphs and 
trees will be discussed. 


1. Introduction 


For graphs G and H, the Ramsey number r(G, H) is the least integer N such 
that in any two-coloring (say with colors red and blue) of the edges of Ky, there 
is either a copy of G in the red subgraph or a copy of H in the blue subgraph. We 
investigate the Ramsey number r(7,, B), where T,, denotes a tree on n vertices 
and B is a bipartité graph. 

Let B be a bipartite graph with parts of order k and / (k </). Thus Bc K,;, 
the complete bipartite graph. For any positive integer n, let fg(m) be the smallest 
positive integer m such that any graph of order n and minimal degree m contains 
a copy of B. The extremal degree number f,(n) is related to the extremal number 
extg(n), which is the minimum number of edges in a graph of order n which 
insures that there is a copy of B. In fact, extg(n) =n: fg(n)/2 with the two 
expressions essentially the same for many graphs B. Therefore, f,(n) = o0(n) for 
any bipartite graph, in fact, fg(n) <c -n“~)”* for an appropriate constant c [13]. 

The main result that will be proved is the following, which gives an upper 
bound for the Ramsey number r(7,, B). 
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Theorem 1. For a fixed bipartite graph Bc K,,; (k <1) there exists a positive 
constant c such that for any j (0<j <k — 1), and any tree T,,, 


r(T,, B)<nt+c-(fa(n)¥"~, 
when A(T,) = (n/(k —j —1))- (i +2): fa(n). 


For k=2 or 3, the bounds given in Theorem 1 are of the right order of 
magnitude, and cannot be improved. In [9] it is proved that if m = A(T,), then 


r(T,, C4) = max{4, n+ 1, r(Ki.m, C4)}. 


Also, r(Ki.m, C4)<m +c'+m}, which is consistent with the degree extremal 
number for C,. This verifies the sharpness of Theorem 1 for k =/=2. For k =2 
or 3 and / arbitrary, there are similar results in {9] indicating the sharpness of 
Theorem 1. For k = 4, little is known about the extremal numbers of K,.;, So it is 
difficult to measure how accurate the results of Theorem 1 are. 

The two extreme cases of Theorem 1 (j =k —1 and j =0) give the following 
two corollaries. When j =k — 1, there is no restriction on the degree of vertices 
in 7,,. 


Corollary 2. For a fixed bipartite graph B, there is a positive constant c such that 
for all trees T,, of order n, 


r(T,, B)Sn+c-:fa(n). 


The above corollary implies that for any tree 7, and bipartite graph B, 
r(T,, B)=n+o(n). For special classes of trees, such as those with no vertices of 
large degree, r(T7,, B)=n+ O(1). This follows from the next corollary. 


Corollary 3. For a fixed bipartite graph Bc K,,; (k <I) there exists a positive 
constant c such that for any tree T,,, 


r(T7,, B)Sn+c, 
when A(T,) = (n/(k — 1)) — 2° fa(n). 


When B=Cz,, the constant c in Corollary 3 was shown to be 1 in {9]. It is 
conjectured that in fact c =k — 1 will suffice in the general case. It is, of course, 
impossible to find a better constant than this, since K,_,,,-1 contains no K,,; and 
its complement contains no connected graph of order n. 

The techniques used to prove Theorem 1 can be used to obtain sharper bounds 
for special classes of bipartite graphs such as even cycles. Corollary 2 implies that 


(Tn, Cu) Snte-n*, 


since fg(n) <a-n'* for B= C,, {2]. The next result gives an improvement of this 
bound when there are no vertices of extremely large degree. 
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Theorem 4. For any integer k =2, there exists positive constants c and d such that 
(Tn, Ca) Sn +e 


for any tree T,, of order n with A(T,)<n—d-n"*, 


2. Notation and terminology 


Notation will generally follow that used in {1]. However, some special 
conventions will be used. We describe some of the special and most often used 
terminology. 

By a two-coloring of a complete graph Ky we will always mean a coloring of 
the edges of Ky using red (R) for the first color and blue (B) for the second color. 
The red subgraph will be denoted by (R) and the blue subgraph by (B). 

By T,, we will mean a tree of order m. A path in a graph G in which all of the 
interior vertices have degree two in G is called a suspended path. An end-vertex is 
a vertex of degree 1, and an end-edge is an edge incident to an end-vertex. 
End-edges are independent if no pair of them is incident. A talon of degree m 
consists of a vertex incident to m end-edges of the graph. 

A bipartite graph B with parts of order k and / will be denoted by B,,,. Thus, 
B,,;¢ K,,;. The minimum degree and maximum degree of vertices of a graph G 
will be denoted by 6(G) and A(G) respectively. The neighborhood of a vertex uv 
of G will be denoted by N,(v), and the neighborhood of a set S of vertices 
(which is the union of the neighborhoods of the vertices of 5) will be denoted by 
N<(S). If H is a subgraph of G, then G—H is the graph obtained from G by 
deleting the vertices of H and any incident edges. 


3. Proofs 


Before proving Theorem 1 and Theorem 4, we will prove some lemmas that 
will handle special cases, and state some known results that will be helpful. A 
basis for the proof is that any large tree will have either a long suspended path, 
many independent end-edges, or a large degree talon. The first lemma deals with 
trees with long suspended paths, and the second lemma with trees with many 
independent end-edges. 


Lemma 5. For |2=k and n positive integers, let T,_, be a tree with a suspended 
path of I(k + 1) vertices and T,, the tree obtained from T,,_, by subdividing one edge 
on the suspended path. If a K,4,—, is two-colored such that T,_,<¢(R), then 
either T, < (R) or Byx &(B). 


Proof. Let X =(x,, x2,...,%m) for m=I(k +1) be the suspended path of 7,_, 
in (R) and let Y be the k vertices of K,4,—, not in the T,_. 


106 P. Erdés et al. 


Assume that 7,¢(R). Then no vertex of Y is adjacent in (R) to two 
consecutive vertices of X. Also, if x;y, x,y €(R) for y € Y, then (assuming x; and 
x; have successors along X) X;41¥,%j41¥ and X;41%)41€ (B). Therefore, if a 
vertex of Y is adjacent to k+/ vertices of X in (R), then (B) > Kys;2 Kx. 
Thus, we can assume that each vertex of Y is adjacent in (R) to at mostk +/—1 
vertices of X. This implies that at least / vertices of X are adjacent to each vertex 
of Y in (B), which completes the proof. O 


Lemma 6. For n>m=k and | positive integers, let T,, be a tree obtained from a 
tree T,-m by adding m independent end-edges. Then, r(T,, By i) 
max{r(Ty—m, By) + kl, n +k - 1}. 


Proof. Let r = max{r(Tn—m, By) + kl’, n +k — 1} and consider a two-coloring of 
the graph G=K, such that (B) $ B,,, and (R) $T,. We will show that this 
leads to a contradiction. 

Successively select vertex disjoint subgraphs H,, H2,..., H,in (B) as follows: 
H, is disjoint from H,, ..., H;-, and contains a maximal number of vertices while 
still being isomorphic to a subgraph of K,,;. Since (B) p K,.;, each vertex not in 
H, is adjacent in (R) to at least one vertex of H;. Let H be the union of these 
subgraphs. Therefore each vertex of G—H is adjacent in (R) to at least one 
vertex in each H; (1<i</). By assumption there is an embedding t of T,_,, into 
(R) such that t(7,_,,) is disjoint from H. 

Let X be the m vertices of T,,_,, incident to the m independent end-edges of T, 
not in 7,-,, and let Y be the vertices of G not in t(7,_,,). Consider the bipartite 
graph L with parts t(X) and Y induced by (R). A matching in the graph L which 
saturates t(X) would imply that (R) >7,, so assume no such matching exists. 
Therefore, by Hall’s theorem [12], there is a subset § of t(X) such that 
|N_(S)| < |S]. Since V(H)c Y, each vertex of t(X) has degree at least / and 
|S|>J. Therefore, the vertices of S are commonly adjacent in (B) to at least 
|Y|-m+12k vertices of Y. This gives a K,.; in (B), a contradiction which 
completes the proof. O 


The following lemma is used to verify that a tree without long suspended paths 
and many independent end-edges must have a large talon. 


Lemma 7 [4]. If a tree T,, does not contain any suspended path with more than s 
vertices, then the number of end-vertices of T,, is at least n/(2s). 


The next lemma is a technical result about the extremal degree number. It is 
intuitively obvious and convenient for some calculations in the proof of 
Theorem 1. 


Lemma 8. Let B=B,; and N=n+c-(fg(n)*) for a constant c>0 and 
0<a<1. Then, for large n, fg(N)<2- f(r). 
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Proof. Clearly, fg(N) <fe(n) +c - (fp(n))~ If a<1, the result follows immedi- 
ately. The same is true if B is a forest, for then fg(n) is bounded. Thus, we 
assume that a = 1, and B contains an even cycle, so fg(n) is unbounded {13}. 

Let G be a graph of order N with 6(G) =2-fg(n), H a subgraph of order n, 
and S=V(G-—H). We will assume that H  B, and show that this leads to a 
contradiction. Since H p B, 6(H) <fg(n) and there is an hy € H which is adjacent 
to at least fg(n) vertices of S. Assume that h,,..., h; have been shown, and let 
H,=H — {h,,...,h;}. Again, 6(H;,) <fg(n), so there exists an h;,, € H; adjacent 
to at least fg() — i vertices of S. For m= [f,(n)] and H’={hy,...,h,,}, each 
vertex of H’ is adjacent to m vertices of S. 

There are at least (7%) k-subsets of S in the neighborhood of each of the m 
vertices of H'. However, there are only ({) k-subsets of S. Thus, for m large, 
some k-subset is in the neighborhood of at least m - (7)/(%') 2 vertices of H’. 
This implies G > B, a contradiction which completes the proof. O 


The major difficulty in proving both Theorem 1 and Theorem 4 is dealing with 
the case of trees with large talons. The following is a greedy algorithm that will be 
used in embedding such trees. 


Algorithm. Our objective is to describe a procedure to assist in embedding a tree 
T, in (R) of a two-colored G=K,4, in which (B) $K,,, and t2l. We will 
assume that the tree 7, contains a talon with q edges and that 6((R)) =n —q. 

Let vu be the center of the talon, and denote the tree obtained from T,, by 

deleting the q edges of the talon by 7,_,. Let w be a vertex of maximal degree in 
(R), and S the vertices adjacent in (B) to w. Clearly 7,_, can be embedded in 
(R), since 6({(R))2=n—q, but our objective is to do this embedding in such a 
way that the end-vertices of the talon can also be embedded. To achieve this, we 
would like to use as many vertices of S$ as possible when we embed T,,_,. 

Define the embedding t of 7,,_, as follows: 

(1) Root the tree 7,_, at v and set t(v) = w. 

(2) For ue V(T,_,), suppose that t(u) has been defined, and u,, u2,..., Um 
are the children of u. Select the images t(u,),..., t(u,,) such that the 
edges t(u)t(u,),..., t(u)t(u,,) €(R), and such that a maximum number 
of the vertices of {t(u,),..., t(u,,)} are in S. If all of these vertices are 
not in S, label the vertex u “bad” and place it in the set D. The vertex uv 
will always be considered a ‘‘bad” vertex. 

This defines an embedding t of T,,_, into (R), since 6((R)) =n — q. Let S’ be 

the vertices of § not in the image of t. Three situations can occur. 

(a) If |S’| <¢, then the embedding t can be extended to 7,,, since there will be 
at least q vertices adjacent to w in (R) which are not in t(T,_4). 

(b) If |S’|>2, and the number of “‘bad” vertices |D| =k, then (B) > Ky, 
since all edges between D and S’ are in (B). This cannot occur. 

(c) If |S'|>¢, and the number of “‘bad” vertices |D| <k, then many edges of 
T,,-q Will be incident to these “‘bad” vertices. In fact, each edge of 7,,_, is 
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either embedded in S, or is incident to a “bad” vertex. Therefore, at least 
(n — |S — S'|) vertices of T,,-, are adjacent to the “bad” vertices. When |5| 
is small in comparison to n, this will be used to generate vertices of large 
degree in 7,. In fact, 7,., must contain a vertex of degree at least 
(n — |S —S'|)/|D|. This will give a contradiction under appropriate condi- 
tions that will exist when the algorithm is applied. 


Proof of Theorem 1. The proof will be by induction on n, the order of the tree. 
An appropriate choice of c insures that the result is true for small values of n. 
Assume the theorem is true for all trees of order less than n, and that n is large. 
Let M(j)= |e: (fa(n)y'“~ | and N=n+M(j), and assume that G= Ky is 
two-colored such that (R) # 7, and (B) # B. We will show that this leads to a 
contradiction. 

The remainder of the proof will be broken into three cases: 

(1) 7, has a suspended path with at least /(k +1) +1 vertices 

(2) T, has kl? independent end-edges, or 

(3) T, has a talon with at least n/(2kI°(k + 1)) edges. 

These cases are exhaustive. If (1) does not occur, then 7, has a least 
n/(2I(k +1)) end-edges by Lemma 7. If (2) does not occur, then all these 
end-edges are involved in at most ki? talons, which gives (3). 


Case (1). T,, has a suspended path with at least /(k +/) +1 vertices 


Let 7,-, denote the tree obtained from 7, by decreasing the length of the 
suspended path by 1. By the induction assumption, (R) > 7,_,. An appropriate 
choice of the constant c insures that Lemma 5 applies, which gives a contradiction 
in this case. 


Case (2). T,, has kl? independent end edges 


Let m=kl’, and let T,_m be the tree obtained from 7, by deleting m 
independent end-edges. Lemma 6 implies 


r(T,, B)<max{n-—mt+c-:(fp(n—m)y" PO +kP, n+k—-1} 
<n+c(fg(n)y"*~” 


for appropriate choice of c. This contradiction completes the proof of this case. 


Before considering Case (3), we will make some general observations about 
(R) and the degree of vertices in this subgraph. Note that by the definition of 
fa(n), A((R))=N~fa(N). By Lemma 8, fg(N)<2fa(n), so A((R))=N— 
2: fp(n). Also, the number of vertices of ‘“‘small” degree in (R) is small. 
Consider any number p (0 <p <1), and let x be the number of vertices of (R) of 
degree less than (1 — p)n. Each of these vertices has degree at least [pn] in (B) 
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and at least ('4"!) subsets of cardinality k in its neighborhood. Since (B) # B, 


«-(M")<a-n(7). 


This implies that x is bounded by a function that depends only on k, /, and p, and 
not on n. These x vertices can be deleted without significantly changing either the 
number or degree of the remaining vertices (appropriately alter the constants c 
and p). Thus throughout the remainder of the proof we will assume that 
6((R))2=(1—p)n. The appropriate choice for the value of p will depend on the 
conditions in Case (3), which follows. 


Case (3). T, has a talon with at least n/(2kI°(k + 1)) edges. 


Select p (0<p <1) such that pn is the maximal degree of a talon in 7,. Thus 
certainly pn = n/(2kI?(k + 1)). Let u be the center of this talon, and 7,_, the tree 
obtained from T,, by deleting the g edges of the talon, where q = pn. Also, let w 
be a vertex of maximal degree in (R), and S the vertices adjacent to w in (B). 
Since A((R))=N—2:°fg(n), S has at most 2: fg(n) vertices. We apply the 
algorithm described earlier (with t= M(j)). Notation used in the description of 
the algorithm will be used in the following discussion. 

Three subcases j =k —1, j=0, and 1<j<k —1 will be considered. 


jok-1 


If c=2, the algorithm yields an embedding, since |S], and hence |S’|, is less 
than ¢ and (a) of the algorithm applies. This gives a contradiction. 


j=0 


In this case we can assume that neither (a) or (b) of the algorithm applies for 
otherwise we would have a contradiction. Therefore, there are at most k — 1 
“bad” vertices, and one of these vertices has degree at least (n — |S|)/(kK — 1) by 
(c), which contradicts the condition on A(T,) for d > 2. 


1<j<k-1 


Both (a) and (b) of the algorithm give a contradiction, so we assume (c) 
applies. Therefore, the set of ‘bad’ vertices D has at most k — 1 vertices, the sum 
of the degrees of these vertices is at least n—|S|, and S’ has at least M(j/) 
vertices. 

Consider the k — j of these vertices which have the largest degrees. The claim is 
that each of these vertices must have degree at least |S]. If not, then the sum of 
the degrees of the k —j—1 largest degree vertices would be at least n—(j + 
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1) |S|, and some vertex of 7,, would have degree at least (nm — (j + 2) |S|)/(K -j - 
1), a contradiction for d =/ +2. 

Let {v, U,, U2, ..., U;-~-1} be any set of kK —j vertices of T,,., which includes v 
and such that each has degree at least |S|. Let T’ be the subtree of 7, spanned 
by these vertices. Since the length of suspended paths and the number of 
independent end-edges is bounded by some function of k and /, the order of the 
tree T' is also bounded by a function depending only upon & and J. For any 
embedding t of T’ into (R) which voids S and with t(v) = w, there is ak —j—-1 
set Y= {t(v,), t(v2),..., T(U;-4-1)} Of vertices in V(G) — S. 

If the union of the neighborhoods in (R) of the k —j —1 vertices Y contain all 
of the vertices of S except for possibly M(j), then the embedding t can be 
extended to 7,,_, using all but possibly M(j) of the vertices of S. Thus clearly, t 
can be extended to 7,, a contradiction. Thus, we assume that there are at least 
M(/) vertices of S adjacent in (B) to each vertex of Y. 

Since 6({R))=(1—p)n, there are many embeddings t of T’ into (R) avoiding 
S and with t(v) =w. In fact, the number of different k —j — 1 subsets Y yielded 
by such embeddings is b - n*~’~' for some positive constant b. Each vertex of 
each of these subsets Y is adjacent in (B) to at least M(j) vertices of S. 

Consider the bipartite graph L with the vertices in the first part being the 
(k —j — 1)-subsets of V(G)—S, and the vertices in the second part being the 
k-subsets of S. If all of the edges between the (k — j — 1)-subset and the k-subset 
are in (B), then the corresponding vertices in L are adjacent. If some vertex in 
the second part of L has degree at least (,/51,)+1, then (B) > K,,;. Since this 
cannot occur, we have the following inequality (the left hand side is a lower 
bound on the number of edges emanating from the first part, and the right hand is 
an upper bound on the number of edges emanating from the second part) 


va ((e5 EME 


Using the fact that fg(n) <c"n“~!”* for some constant c”, this implies that 


Cis ((, ae 1) )2 


If c is sufficiently large, this yields a final contradiction, which completes the proof 
of this case and the theorem. 


The same techniques used in the proof of Theorem 1 apply to special cases of 
bipartite graphs, in particular for even cycles. 


Proof of Theorem 4. The initial observations, the nature of the induction, and 
the proof of the first two cases are identical to the proof of Theorem 1 with C., 
considered as a B,., bipartite graph (i.e. /=k). Therefore we will use precisely 
the same notation used in Theorem 1 with /=k, and assume we are at the point 
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of beginning Case (3). Thus 7, has a talon with at least n/(4k°) edges. Recall that 
fa(n)<c’ -n™ for B= Cy [2]. 

Select p (0< p <1) such that pn is the maximal degree of a talon in 7,. Thus 
certainly pn =n/(4k°). Let v be the center of this talon, and 7,_,, the tree 
obtained from 7, by deleting the q edges of the talon, where g = pn. Also, let w 
be a vertex of maximal degree in (R), and S the vertices adjacent to w in (B). 
Since A((R)) 2 N —2>°f,(n), S has at most 2 - fg(n) vertices. 

Let T’ be the tree obtained from T,,_, by deleting all of the vertices of degree 
1. Since the length of suspended paths and the number of independent end-edges 
is bounded by some function of k and J, the order of the tree T’ is also bounded 
by a function depending only upon k and J. Hence, there is an embedding t of T’ 
into (R) with t(v)=w and t(7’) disjoint from S. In fact, there is such an 
embedding which avoids not only S but any c"n vertices not in § as long as, for 
example, cn < (1 —p)n/2. 

If the embedding t can be extended to 7,,_, using all of the vertices of S except 
for possibly c, then it can clearly be extended to 7,,. Thus, we assume that the 
embedding cannot be so extended, so there is a vertex not in S which is adjacent 
in (B) to at least c vertices of S. This can be repeated c”n times to obtain a set A 
of c"n vertices, each of which is adjacent in (B) to at least c vertices of S. 

Consider the bipartite subgraph L of (B) induced by the parts A and S. In L, 
each vertex of A has degree at least c relative to §, c<|S$|<c’- nvk and 
|A| =c"n. Therefore by a result in {11], there is a path in L of length 2k — 2 with 
both end-vertices in S. This path with w, which is adjacent in (B) to each vertex 
of S, generates a C,,. This contradiction completes the proof of the theorem. 


4. Problems and comments 


Two critical graphical parameters in the determination of the Ramsey number 
r(S, G), when S is a large order sparse graph (or in particular a tree), are the 
order of S and the chromatic number 7(G) of G. Also, the Ramsey number 
r(S, B) where B is a bipartite graph induced by two color classes in a x(G) 
coloring of the vertices of G, appears to be an important factor in determining 
r(S, G) {6, 8]. This is one of the motivations for working on the problems 
considered in this manuscript. 

There are several places where the results presented could be improved; 
however, one is of particular interest. If 7, is a tree with only “‘small” degree 
vertices, then 


r(T,, Be =nte 


for a sufficiently large c. It would be nice to show that c =k — 1 is sufficient in 
general. For special classes of graphs this has been verified in [4] and [9]. 
There are several papers dealing with the Ramsey number of a fixed graph and 
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a sparse graph [3—5, 8]. It would be of interest to know which sparse graphs could 
replace the trees of Theorem 1 and Theorem 5 without altering the results. 
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1. 


Graffiti is a computer program which makes graph-theoretical conjectures. It 
was written in 84-85 and was preceded by a similar but less complex program, 
Little Paul, on which the author collaborated with Shui-Tain Chen. 

The basic idea of Graffiti is that it “knows” certain graphs and it is capable of 
evaluating certain formulas formed from graph-theoretical invariants. If none of 
the graphs with which Graffiti is familiar is a counterexample to a formula then 
the formula is considered to be a conjecture. At present Graffiti is capable of 
computing about 60 invariants and it performs several functions but I shall 
describe here only those which are relevant to conjectures described in the next 
section. The conjectures are of the form I1<J, Je J + K andI+J¢K+L, where 
literals run over 20 distinct selected invariants plus a constant invariant 1. They 
are respectively called conjectures of the first, second and the third type. 

After the first run of the program the total number of conjectures was about 
7500. The Library of the program had at the time about 40 graphs. 

The number of conjectures, particularly those which are completely trivial, is 
the main problem and more than half of the program consists of various heuristics 
whose purpose is deletion of trivial and otherwise noninteresting but true 
conjectures. The conjectures in the next section were obtained by procedures 
IRIN and CNCL. IRIN deletes those conjectures which by transitivity follow 
from others. 

All conclusions that Graffiti draws are based exclusively on graphs it knows, so 
“follows” means of course “follows as far as the graphs in the library of program 
are concerned”. During the first run IRIN reduced the total number of 
conjectures to about 1700. 

CNCL deletes those conjectures of the second and third type in which one of 
the invariants on the left is always smaller than an invariant on the right. This 
procedure may undoubtedly remove a number of interesting conjectures but it is 
the only one which seems to operate completely independently from IRIN. 

During the first run CNCL following IRIN reduced the list of conjectures to 
about 140. For other sets of 20 invariants the number was usually at least 100 
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higher and that is one of the main reasons why I decided on this particular set of 
invariants. 

At this point I divided conjectures in four categories: False, Open, Trivial and 
Proved. The last two types were stored in separate files and after repeated runs 
they would be automatically removed from the new lists of conjectures. After this 
the last three files could be updated. 

For each false conjecture I had to inform Graffiti about a counterexample; 
Graffiti is capable of defining certain types of graphs and of performing some 
operations on graphs leading to more complex examples. 

After the first run there were two conjectures which were already proved 
before. I disproved about 15 using five counterexamples and I classified about 40 
conjectures as trivial. 

During the following month I added to the Library of Graffiti about 40 graphs 
but they were counterexamples to about 80-100 conjectures which appeared in 
the file Open; it was not unusual for this file to grow in size after a few 
counterexamples were added to the library. The total number of conjectures was 
of course steadily going down, but if IRIN had rejected a conjecture because it 
had “followed” from a false conjecture then the rejected one could have 
reappeared after the false one has been disproved. 

At the time of writing of this paper there are 6745 conjectures and the file 
Open contains 35 conjectures. The conjectures of the third type were not yet 
really studied by anyone and I decided not to include them here. There are 12 
conjectures which were proved and the file Trivial contains about 80 conjectures. 

The conjecture number 14 was almost completely proved by William Waller. 

Graffiti has procedures computing chromatic number, independence and size of 
the matching but they are so slow for larger graphs (some of the counterexamples 
have up to 80 vertices) that I choose rather to inform the program about their 
values. That was particularly essential when I experimented with various graphs 
and Graffiti was verifying if they were counterexamples to a conjecture. 


We are going to use the following notation: If R=(n,1m,...,7,) is a 
real-valued vector then the value of the component which occurs most often is 
called the mode of R and the maximal frequency of R is the number of occurrences 
of the mode. In the case of a tie the mode is the largest component. 

The Degree denotes the degree sequence of a graph. Hence, for example 
maximal frequency of Degree of a path with 3 vertices is 2 and the mode is 1; the 
maximal frequency of distance is the distance which occurs the maximum number 
of times. The temperature of a vertex is d/n — d where d is the degree of the 
vertex and n is the number of vertices. Vectors can be considered random 
variables on a uniform space; this gives rise to some invariants defined by means 
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of the average value and the variance. However, the average distance is the mean 
of the vector is the expected distance between distinct vertices. 

A vertex c of a graph is central if every other vertex of G can be reached from c 
in a minimum number of steps. The number of central vertices is the center of G 
and the number of steps is the radius. The maximum distance between any two 
vertices is the diameter and the boundary consists of those vertices whose distance 
from some vertex equals to the diameter. 

The inverse degree is } 1/d; where d; is the degree sequence. The Randic 
Index, or Randic, is ¥ (d;d,)~} where the summation extends over pairs of 
adjacent vertices. The last two invariants are not defined in the presence of 
isolated vertices and of course metric invariants apply only to connected graphs. 

Independence and matching denote respectively the size of the largest inde- 
pendent set and the size of the largest matching. 

Finally the rank is the rank of the adjacency matrix, and the zenith is the 
number of vertices of maximum degree. 

The following conjectures are now in the file Open: 


average temperature < rank 

average temperature < variance of degree + maximal frequency of degree 
inverse degree < Randic index + maximal frequency of degree 

mode of distance < radius + Randic index 

mode of distance < average distance + Randic index 

mode of distance < matching + average distance 

radius < zenith + maximal frequency of degree 

radius < variance of degree + maximal frequency of degree 

radius < Randic index + average temperature 

radius < Randic index + variance of degree 

. radius < average distance + Randic index 

. radius < 1 + Randic index 

. average distance < Randic index 

. average distance < independence 

. average distance < variance of degree + maximal frequency of degree 
. average distance < variance of degree + inverse degree 

. average distance < mode of distance + inverse degree 

. chromatic number < 1 + rank 


CRN AMWEYNS 


etl arti willow oo 


Below are listed conjecture which were proved so far. The 6th conjecture was 
proved by Shui-Tai Chen and the 7th by William Waller. The 10th was proved by 
Chen and myself. Sometimes they resulted in stronger theorems, but they are 
stated here in the form in which they were conjectured. 


1. radius < independence 
2. average temperature < chromatic number 
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3. diameter < rank 

4. radius = matching 

5. average distance < 1 + matching 

6. inverse degree < independence + Randic index 
7. inverse degree < matching + independence 

8. Randic index < matching + independence 

9. Randic index < number of vertices 

10. average degree < matching + Randic index 

11. Randic index < matching + inverse degree 

12. matching < Randic index + inverse degree 


None of these theorems are very difficult, but almost every one, I think, is of 
some interest. 

For example, the second theorem on the list is one of the easiest but it 
provides, what I consider a neat lower bound for the chromatic number y. It can 
be proved as follows: Let C be a good y-coloration of G and let G be a complete 
%-partite graph compatible with this coloration. Because the average temperature 
of G is greater than that of G it is enough to prove the theorem for G. But the 
average temperature of G is x — 1. 

Hence we proved that the average temperature is <y—1 and the equality 
holds true iff G is complete k-partite graph. 

This conjecture realized my hope that Graffiti will produce interesting 
conjectures which are easy to prove. But beforehand I did not see too many 
justifications for this hope. The only reason I could think of was that Graffiti 
might find overlooked facts and those should be independent from both their 
difficulty and the significance. 

On the other hand I expected Graffiti to find conjectures difficult to prove. I 
thought that the program would operate like a statistician who finds a correlation 
without a cause. Perhaps some conjectures which Graffiti has already made are 
difficult to prove but in most of the cases one can clearly see a reason for making 
the conjecture. 

A few of the counterexamples were found by Chen and Waller and a few are 
based on an idea of Erdés and Spencer dealing with the disproved conjecture 
average distance < inverse degree. 

The graphs represented in the Fig. 1 are fairly typical in that each has an 
extreme value for one of the invariants. This is the case with the mode of distance 
of Crab, the average distance of Barbells and Binaries and the Randic Index of 
Milky Way. Finally, in Milkweeds the radius is almost as large as the diameter. 

Some of these graphs are fairly large but the reader may notice that for all it is 
relatively easy to write the defining programs. Thus so far I have not had to resort 
to elaborate lising of edges to define a graph. But a close call came when Chen 
found an 8-vertex counterexample to the conjecture radius < maximal frequency 
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Binary 


Milkweed 


Barbell 


Milky Way 


Crab 


Fig. 1. 


of degree. Eventually I wrote a procedure which automatically searches for 
counterexamples to conjectures of this type by optimizing the polarity. The 
procedure has since found several other useful graphs. 

In the future I will probably use more such procedures. There are many ways 
of doing this and I may even already have a prototype for one. In 1982 Noriko 
Naumann wrote as a part of her master’s thesis a short but scintillating program 
called Pythia which guesses the next number which should appear in a sequence 
of integers. The program guesses patterns by simplfying them in a manner similar 
to the one described by Dewdney in the April 1986 issue of Scientific American. 


118 S. Fajtlowicz 


It can be adapted to graph theory and it has both conjecture-making and 
example-finding potential. 

I would like to thank Robert Cottingham for writing for me a fast program 
deleting the text of one file from another. 


Note added in Proof. Conjectures 14 and 1 were proved respectively by Fan 
Chung and myself. Conjecture 7 was refuted by Shui-Tan Chen. 
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With but a few exceptions, the Ramsey number r(G, 7) is determined for all connected 
graphs G with at most five vertices and all trees T. 


1. Introduction 


For graphs G and H define the Ramsey number r(G, H) as the least number N 
such that in each two-coloring (R, B) = (red, blue) of the edges of Ky there is a 
red copy of G or a blue copy of H. Sometimes we start with a graph L on N 
vertices and refer to L as the red graph and to its complement L as the blue 
graph. 

We investigate r(G, H) when G is any connected graph with at most five 
vertices and H is an arbitrary tree. All but a few of the numbers are obtained. In 
the process of the investigation close attention is given to the graphical 
parameters of G and H which affect the Ramsey number. 

Similar Ramsey numbers have been considered in papers [1-—4, 7-9], when G is 
a fixed graph and H is a large sparse graph (one with many vertices and few 
edges). Of course a large order tree is one such sparse graph. Complications arise 
in finding this Ramsey number when H is of arbitrary order or has large maximal 
degree, even when H is a tree and G is of small order. 

To avoid lengthy case analysis type arguments most of the Ramsey numbers 
discussed are given without proof. Instead emphasis is given to the general 
strategy of the proofs and how the results relate to known Ramsey numbers. 


2. Terminology, notation, and related results 


For the most part the terminology and notation used conforms with the usual 
and accepted. Specialized terminology is summarized in what follows. 

For a graph G, p(G) denotes the number of vertices of G and s(G) (called the 
chromatic surplus) is the minimum number of vertices in a color class under all 
x(G) — vertex colorings of G. If H is a subgraph of G, then G — H denotes the 

* Research supported by NSF under grant DMS-8603717 
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graph obtained by deleting the edges of H. For disjoint graphs G and H, the join 
G +H is the graph obtained by adding the missing edges between vertices of G 
and vertices of H. The symbol tG denotes ¢ disjoint copies of the graph G. 

In order to conveniently describe trees which contain large stars as subgraphs 
we develop a special notation. Let F be a forest with each tree in the forest 
rooted, say with roots v,,v2,...,u,, and K(i,s,) a star with vertices 
Up, Uy,..-,U, and uy the center of the star. If s=vr, a new tree is found by 
identifying u; and uv; for 1<i<r. This tree is denoted by 7,(F) where n is the 
total number of vertices in the tree. In the forest F unmarked stars (paths) are 
assumed rooted at their centers while ones marked with an asterisk are rooted at 
a vertex of degree one. For example, for the forests F,=K(1,2)UP3 and 
F,=(K(1, 3))*U Py, Tio(Fi) and Tyo(F,) are shown in Fig. 1. Throughout the 
paper the symbol 7, is used exclusively to denote an arbitrary tree on n vertices. 

Let G and H be graphs with H connected such that p(H)2=s(G). Then 
r(G, H) = (x(G) — 1)(p(A) — 1) + s(G) in view of the example in which the red 
graph is the complete multipartite graph with ¥(G) — 1 parts of size p(H) — 1 and 
one of size s(G) — 1. If r(G, H) = (x(G) — 1)(p(H) — 1) +5(G) then H is said to 
be G-good. One of the earliest results in generalized Ramsey theory was that of 
Chvatal [6], in which he stated that r(K,,, T,)=(m—1)(n—1)+1 for all 
m,n=>1, i.e. all trees are K,,-good. Thus for graphs of this paper, G of small 
order and H an arbitrary tree, the chromatic number of G and the order of H are 
parameters which affect r(G, H). 

It is known that 

(1) a connected sparse graph H of sufficiently large order and appropriate 

maximal degree is G-good for G a fixed graph [7], 


Ty9 (Fy) Ti 9 (F) 


Fig. 1. 
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(2) a tree T, is K(1, 1, m,, mz, ..., m,)-good for m,, m2,..., m, fixed and n 
large [3], and 
(3) for large n the star K(1,7n) is neither K(1, m,, m2,...,m,)-good for 


m, = my_, 2+ +: 2m,2=2 (each m; fixed), nor K(2, 2) good [4, 9]. 
In fact it is shown in [9] that r(K(2, 2), K(1,n))>n + [n}—6nis| for n large, and 
that r(K(2, 2), T,) = max{4, 2 + 1, r(K(2, 2), K(1, A(T,))} for all n. We shall see 
that the only troublesome small graphs G—those where r(G, T,) is not always 
determined—are those which contain K(2, 2) as an induced subgraph. 
We record (for later reference) in Theorem A two goodness results. Clearly 
both are generalizations of Chvatal’s theorem. 


Theorem A. (1) Jf l,m21, n2=2 and |>(m—1)- |(m—1)/(n-1)|(n -1), 
then | 
r(K;+ Kn,» Tn) = (1+ esac -1)+1. [10] 


(2) If m=6, n2=3 and T,#Kin-1(n =4), then r(K,, —¢K2, T,)=(m—-t- 
1)(n — 1) +1 for all t, O<¢ < [(m —2)/2]. [11] 


We wish to see which other graphical parameters of G and T,, affect r(G, T,). 
First let ¢; and ¢, be positive integers and set L = ¢,K,(g)-1 U®K,(G)-2- Choose t 
and ¢, such that 7, is not a subgraph of L and such that p(L) is maximal. Setting 
t(G, T,,) = p(L) it is clear that r(G, T,,) = ¢(G, T,,) + 1. 

As innocent as this bound appears it frequently is the value of r(G, T,,). We 
cite two such cases which are recorded formally in Theorem B below. To do this 
we need additional notation. Let a(G) denote the independence number of G. 
For the tree 7,, let a'(T7,,) = min{a(F) | F is the forest obtained by deleting from 
T, a vertex and its neighbors}. Thus a’(7,,) is a measure of how small the 
independence number of a non-neighborhood of a vertex of the tree 7, can be. 
One can for example show that if 2a’+3<k, then n+k—-—2-—a'(T,)-—d= 
t(P,, T,) +1 where 6 =0 if k —1 divides n+k—3-a'(T,) and 6 =1 otherwise. 
In fact a graph L which works in this case can be obtained as follows: let 
n+k—3-a'(T,)—6=a(k-—1)+6 where 0<b<k-—1 and set 1,(R, T,)= 
a—k+2+5 and ¢,(P,, T,)=k—1— 6. It can be checked that the resulting L 
satisfies the requisite condition, (see [1]). 


Theorem B. (1) Let H be a connected graph with n vertices and no more than 
n(1 + 1/81k°) edges. Then for k =2 and n> 352k” 


r(P,, H) = max{n + [4k] —1, n+ k-—2-a'(H)— 6} 


where 5 =0 if k— 1 divides n—k —3- a'(H) and 6 =1 otherwise. [1] 

(2) Let k be an integer =2, and n2=2(3k —2)(2k —3)(k—2)+1. Then 
max{n,n +k —1-—a'(T,) — 6} <r(K(1, 4), T,) S max{n, n + k -—1-'(T,)} 
where 6 =0 ifn —k —2-—'(T,) is divisible by k and 5 = 1 otherwise. [8] 
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The remaining relevant graphical parameter of G and T7,(p(G) <5) which 
affects the value r(G, T,,) is the maximal degree A of both G and 7,,. It is well 
known (see [5]) that r(K(1, 5,), K(1, s2)) =5,+52— €(s,, 52), where 


1 when both s, and s, are even 
0 otherwise. 


€(5,, 52) = | 


One expects r(G, 7,,)=r(A(G), A(T7,)) for certain special graphs G and special 
trees T,,. 

Having explored the important grapical parameters affecting the Ramsey 
number we are prepared to define the function f which (we shall see) does for the 
most part satisfy r(G, 7,,)=f(G, T,) when G is of small order. Let G and H be 
graphs, H connected, and define 


f(G, H) = max{(x(G) — 1)(p(H) — 1) 
+5(G), (G, H) +1, A(G) + A(H) — e(A(G), A(H))} 


From the above discussion it is clear that f(G, T,,) is a lower bound for r(G, T,) 
and that there are pairs of graphs where each of the values maximized in f 
dominants the remaining ones. 


3. Main results 


With the important groundwork covered we are in the position to state the 
results. Before doing so a comment or two should be made about the value of 
«(G, H). For arbitrary graphs this value is difficult, if not impossible, to compute. 
In the case where it is needed in the theorems below such is not the case. Surely 
when 7(G) =3 and H is a tree 7,, except possibly for n =1 or 2, f(G, T,)> 
(G, T,) + 1. Thus the reader will only need to be concerned with ¢(G, T,,) when 
G is bipartite. More importantly it can be shown that in all cases considered in 
Theorem 1 (below) ¢(G, T,,) only affects the value of f(G, 7) when A(T,) = 
n—5. Using this makes f(G, T,,) easy to calculate. 


Theorem 1. Let G be a connected graph with at most five vertices and let T,, be a 
tree on n vertices, n 29. Then with but a few exceptions (those listed in (i)—(iv) 
below) we have r(G, T)=f(G, T,). 
(i) r(K5— 2K,, K(1,n —1))=2n+1 when n is even. 

(ii) r(Ks;— Py, K(1,n—1))=2n when n is even 

(iii) r(K(2, 2), T,) =r(K(2, 3) —e, T,,) = max{n + 1, r(K(2, 2), A(T,))} 

(iv) There exists a constant c such that n+1<r(K(2, 3), T,)<max{n +c, 

r(K(2, 3), A(T,))}. 


The lack of exactness in items (iii) and (iv) of the theorem stems primarily from 
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not knowing r(G, K(1,n)) exactly when G is bipartite. It is known that 
n+ |n}-6nt] <r(K(2, 2), K(1,2))<n+ [n] +1 for n large (7, 9]. 


Theorem 2. /f T,, is a tree with n <8 and G a connected graph with p(G) <5, then 
r(G, T,,) = max{f(G, T,), f(T,, G)}, except for the pairs (G, T,) given in Tables 1 
and 2. 


Since there are thirty-one connected graphs with at most five vertices and 
forty-eight trees with at most eight vertices the number of exceptions given in the 
tables is small. We will not prove Theorem 2 here, since the argument is tedious 
and lengthy. Also only part of the proof of Theorem 1 will be given for the same 
reason. 


Partial Proof of Theorem 1. We only concern ourselves with the proof when 
z(G) = 3. The reason is twofold. First the proof for those cases when G is a tree 
and n is small has many special subcases and is similar in nature to the proof of 
Theorem B. Thus little is gained by presenting this part of the proof. The only 
remaining possibility is when x(G) = 2 and K(2, 2) is a subgraph of G. The proof 
of (iii) and (iv) is essentially in [9], so neither is repeated here. 

Therefore assume x(G)>2 so that f(G, 7,)=(x(G)-—1) (n-—1)+1. When 
G=Ks, K,, or Ks—e the value of r(G, 7,) is given in Theorem A(1). Also 
clearly r(K,, T,)<r(Ks— Ps, T,), r(Ks—K(\, 3), T,) <r(Ks—e, T,) and by 
Theorem A(1) r(K,, T,,) =r(Ks—e, T,) =3n —2, n =3. Thus the theorem holds 
when x(G) = 4. 

The only remaining possibility, when x(G)=3, is handled as follows. First 
r(K3, T,) =r(K,—e, T,) by Theorem A(1) and r(K,—e, T,)2r(K3+e, T,)= 
r(K3, T,,). Each remaining 3-chromatic graph G is a subgraph of one of K;— 2K, 
K,— P,, and Ks — K3. But r(G, T,,) = 2n — 1 for each such G and we show in the 


Table 1. 


T,3K,) 


T(K(1, 2)) 


Soa ee a 


T,(K(1, 3)) 
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Table 2. 
K(2, 3) 
K(, 3) 7 
T3(K2) 7 
K(, 4) 8 
T(P3) 8 
T.(K2) 8 
To(K(1, 2)) 8 
K(, 5) 10 
T(K2) 10 
K(1, 6) i 
T,(P3) 10 
Ta(2k2) 10 
T,(K) 1 
K(, 7) 13 
T(K(1, 3)*) 10 
Ta(K(1, 2)) 10 
7,(K(1, 3)) 10 


lemma given below (for n=6) that r(K;— 2K», T,)=r(Ks— Py, T,) =r(Ks— 
K;, T,)=2n—1 when T,#K(1,n—1) for n even. Thus upon proof of the 
lemma the 3-chromatic case is proved except when 7, = K(1, 2 — 1) with n even. 
It is easy to check for x¥(G) =3 and n even that r(G, K(1, n — 1)) =2n — 1 unless 
G = K,;— P, or K;— 2K; and these are considered in the lemma. 


Lemma. For n = 6 each of the following hold. 
(i) r(Ks— K3, T,) = 2n -1. 


2n+1 = when T,, = K(1,n—1), n even 
2n—1 otherwise. 
(iii) r(Ks — Py, K(1, 2 — 1)) = 2n for n even. 


(i) r(Ks~2K2, T,)={ 


Proof (i) Surely r(K;— K3, T,,) = 2n — 1 so let each edge of K2,_, be colored red 
or blue. We suppose there is such a coloring where there is no red Ks — K3 and 
no blue T,, and show this leads to a contradiction. 

First let 7,,=K(1,n—1). Since the two colored K2,-, contains no blue 
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K(1, n —1), each vertex has red degree at least n. But r(K3, T,,) = 2n — 1 so that 
the red graph contains a K; whose vertices we label v,, v2, v3. If for i#j, uv; and 
u; have two common red adjacencies off the red triangle, then the red graph 
contains a K;— K3. Since this cannot happen, there are at least 3+ (nm — 2)+ 
(n — 3) + (n — 4) = 3n — 6 vertices in K2,,_,;. Hence n <5, a contradiction. 


Next assume 7, #K(1,2—1). Since Theorem 1 is assumed true when 
x(G) =2, r(K(, 3)T7,) <n +1. Thus if the red graph in the two colored K2,_, 
has a vertex of degree n + 1, its neighborhood contains a red K(1, 3) implying the 
existence of a red K;— K3. Hence each vertex of the K2,_, has blue degree at 
least n — 2, and since 7, #K(1,n—1), the blue graph either contains a 7, or a 
K,,_,. Since the former is impossible, the existence of a blue K,,_, and no blue T,, 
implies the red graph contains a K(n — 1, n). This forces either a red Ks — K3 ora 
blue 7,,, a final contradiction. 


(ii) We first consider the case when 7, = K(1,n—1) with n even. To see 
r(K;—2K,, K(1,n—1))>2n let L (the red graph) be the graph formed by 
placing a perfect matching in each part of a complete bipartite graph K(n, n). 
Clearly this graph contains no K;—2K, and the blue graph L contains no 
K(1, n — 1). Thus color each edge of K2,,, red or blue and suppose the coloring 
is such that there is no red Ks — 2K, and no blue K(1, 2 —1). We consider two 
possibilities, the first of which is that the red graph contains a K,. Let S denote 
the set of those vertices of K2,,, not part of the red K,. No red Ks — 2K> implies 
each vertex of S is adjacent in red to at most two vertices of the K,. Thus the 
total number of blue edges incident to the four vertices of the K, is at least 
2(2n — 3) implying that one of these four vertices must have blue degree at least 
n—1, a contradiction. 

Since by Theorem A r(K,— K>, T,,) =2n —1, the two colored K2,,, contains a 
red K,— K, and as just argued no red K,. But by repeating the above argument 
where the red K,— K, replaces the red K,, and S is the set of vertices of K2,,,, 
not part of the red K,—K2, we again obtain a blue K(1,n-—1). This 
contradiction completes this part of the roof. 


Next consider the remaining possibility, when 7,, is a tree that is not a star with 
an odd number of edges. We then show r(K; — 2K2, T,) = 2n —1 for n =3. Since 
2n—1 is clearly a lower bound, we need only show r(Ks5— 2K), T,) <2n—1, 
which is done by induction. It is easy to check that this holds for n =3 and for 
n=4 when 7,#K(1, 3). Thus assume the result for values less than n and 
consider a K,,, with each edge colored red or blue. We consider two cases. 


Case 1. The red graph contains a K, 


Delete from 7, two end vertices x and y such that the resulting tree 7,-,= 
T, -x—y is not a star when 7, #K(1,n—1) and n26. By induction the two 
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colored K2,-5 obtained from the two colored K2,-, by deleting the red K, 
contains the blue 7,2. let « and v denote the neighbors of x and y in 7,. Now u 
and v are blue adjacent to distinct vertices of the red K, (when u=v this means 
this vertex has two blue adjacencies to the red K,) or one of them has at least 
three red adjacencies to the red K,. Hence there either is a red K; —2K, or a 
blue 7,, in the two colored K2,-;. 


Case 2. Since r(K,— K2, T,) =2n—1, the red graph contains a red K,— K2 
and no red K, 


The argument for this parallels the one given in Case 1, simply repeat the 
argument given with the red K,— K2 replacing the red K,. 


(iii) To see r(K;—P,, K(1,n—1))>2n-—1 let L (the red graph) be the one 
obtained from the complete bipartite graph K(n, n— 1) by inserting a perfect 
matching in the part with n vertices. Clearly this graph contains no K;— P, and 
the blue graph L contains no K(1, 7 — 1). 

Two color the edges of a K2, and suppose this graph contains no red Ks— P, 
and no blue K(1, n— 1). For n=4 it is easy to verify this is impossible. Hence 
assume n is even, n > 4, and proceed by induction or n. Just as noted earlier the 
red graph contains a K,— K2. Let S be the remaining 2n — 4 vertices of the K2, 
(other than those four of the K,— K2). By the induction assumption some vertex 
of S has blue degree at least n — 3 relative to the graph induced by the set S. Also 
each vertex of S must be blue adjacent to at least two vertices of the red K,— Kz 
to avoid a red K;—P,. This makes some vertex of S have blue degree at least 
n—1, acontradiction. This completes the proof of the lemma. OU 


References 


{t] S.A. Burr, P. Erdés, R.J. Faudree, C.C. Rousseau, and R.H. Schelp, Ramsey numbers for the 
pair sparse graph, path, or cycle, Trans. of A.M.S., 269 (2) (1982) 501-511. 

[2] S.A. Burr, P. Erdés, R.J. Faudree, C.C. Rousseau, and R.H. Schelp, The Ramsey Number for 
the pair complete bipartite graph—graph of limited degree, Graph Theory With Applications to 
Algorithms and Computer Science (Wiley, New York, 1984) 163-174. 

{3} S.A. Burr, P. Erdés, R.J. Faudree, R.J. Gould, M.S. Jacobson, C.C. Rousseau, and R.H. 
Schelp, Goodness of trees for generalized books, Graphs and Combinatorics 3(1) (1987) 1-6. 

[4] S.A. Burr, R.J. Faudree, C.C. Rousseau, and R.H. Schelp, Ramsey numbers for the star-like 
multipartite graphs, JGT 7 (1983) 395-409. 

[5] S.A. Burr and J.A. Roberts, On Ramsey numbers for stars, Utilitas Math. 4 (1973) 217-220. 

[6} V. Chvdtal, Tree-complete Ramsey number, JGT 1 (1977) 89-91. 

{7} P. Erdos, R. J. Faudree, C.C. Rousseau, and R.H. Schelp, Multipartite graph-sparse graph 
Ramsey number, Combinatorica 5(4) (1985) 311-318. 

[8] P. Erdés, R.J. Faudree, C.C. Rousseau, and R.H. Schelp, Graphs with certain families of 
spanning trees, JCTB 32 (2) (1983) 162-170. 


Graph-tree Ramsey numbers ; 127 


[9] P. Erdés, R.J. Faudree, C.C. Rousseau, and R.H. Schelp, Some complete bipartite graph-tree 
Ramsey numbers, to appear in First Danish Intl. Conf. on Graph Theory. 
{t0] P. Erdds, R.J. Faudree, C.C. Rousseau, and R.H. Schelp, Generalization of a Ramsey result of 
Chvétal, The theory and applications of graphs (Wiley, New York (1980) 351-361. 
{11] R.J. Gould and M.S. Jacobson, On the Ramsay numbers of trees versus graphs with large clique 
number, JGT 7 (1983) 71-78. 


This Page Intentionally Left Blank 


Discrete Mathematics 72 (1988) 129-139 129 
North-Holland 


ENUMERATING PHYLOGENETIC TREES WITH 
MULTIPLE LABELS 


L.R. FOULDS* and R.W. ROBINSON** 
"School of Management Studies, University of Waikato, Hamilton, New Zealand 
** Department of Computer Science, University of Georgia, Athens, GA 30602, U.S.A. 


Received 15 August 1986 


Evolutionary trees are used in biology to illustrate postulated ancestral relationships 
between species and are often called phylogenetic trees. They can be characterized in graph 
theoretic terms by certain classes of labelled trees. Disjoint subsets of the labelling set are 
assigned to tree vertices so that all pendant vertices and any vertices of degree two are labelled. 
Here we determine exact and asymptotic numbers for two classes of trees in which multiple 
vertex labels are allowed. In the first class vertices of degree two are forbidden and in the 
second class vertices of degree greater than two cannot be labelled. A general method is 
presented for deriving the asymptotic analysis of any multiple label case. Asymptotic results for 
the two classes of trees under study are then obtained by applying this method to previously 
published results. This paper completes work by the authors on the enumeration of various 
classes of phylogenetic trees. 


1. Introduction 


Biologists often represent postulated evolutionary relationships between exist- 
ing biological species by means of a tree. Such a diagram, linking related species 
to acommon ancestor, is called a phylogenetic tree or phylogeny. 

In this paper we define a phylogeny to be a tree in the graph theoretic sense 
together with a set {1, 2,...,m}, of labels and a function f, mapping the labels 
into the vertex set of the tree. Every vertex of degree less than three in the tree 
must be in the image of f. Note that some vertices may possess multiple labels 
and some vertices no label at all. The label set corresponds to a given set of 
existing species. The magnitude of a phylogeny is the number n of its labels. The 
order of a phylogeny is the number of vertices in it. A planted phylogeny is a 
phylogeny having a pendant vertex which is distinguished, and is termed its root. 
This root represents the common ancestor of all the species in the labelling set 
and thus is not given one of the labels of this set. The construction and the 
biological significance of phylogenies have been discussed by many authors. See 
Penny et al. [11] for a recent exposition. The graph theoretic notation and 
terminology used in this paper is fully defined in the book by Harary [7]. 

The above mentioned paper by Penny et al. discusses the determination of 
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phylogenies which satisfy the optimality criteria of various scientific models of 
evolution. In the search for phylogenies which are optimal in this sense it is of 
interest to know how many possible feasible phylogenies exist for a given set of n 
labelled species. The exact and asymptotic numbers of phylogenies with given 
magnitude was calculated by the present authors in [3]. The mean and variance of 
the orders of these trees was also determined as well as the effect of requiring that 
each label be a singleton. The authors carried out the same analysis for binary 
phylogenies in [4]. The effect of restricting the labelling to pendant vertices was 
also studied. In [5] this analysis was carried out for phylogenies without vertices 
of degree two and where only pendant vertices are labelled. Once again the effect 
of making all the labels singletons was studied. In [6] the analysis was repeated 
for phylogenies which are labelled only with singletons and where there are no 
vertices of degree two. A second case was also analysed in which there are no 
restrictions on vertex degree, all nonempty labels are still singletons, but only 
vertices of degree one or two can be labelled. The four papers just referenced 
comprise an analysis of 10 classes of phylogenies. There are two remaining cases 
which complete the set of all feasible combinations of restrictions on vertex 
degree, labelling, and label size: 


Case I. There are no vertices of degree two, and interior vertices may possibly be 
labelled. 


Case 2. There are no restrictions on vertex degree, and only vertices of degree 
one or two can be labelled. 


In each of these cases, labels need not be singletons. In the biological context, 
this corresponds to the case where different species cannot be distinguished by the 
comparison criterion being used to construct the phylogeny. 

The objectives and terminology which are common to the two cases are now 
introduced. We denote by 7,, the number of phylogenies of magnitude n. A useful 
device is the exponential generating function: 


T(x) = x T,x"/n! (1.1) 


It is also useful to calculate the mean and variance of the numbers of vertices in 
phylogenies of given magnitude. This is done by determining T,,,, the number of 
phylogenies with magnitude n and order p. The corresponding exponential 
generating function is 


2n-2 


T(x, y)= 2 2 Typ? y?/n!. (1.2) 


1 p= 
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Note that 
T(x) = T(x, 1). 


A proof that 1 <p <2n — 2 is given in [2]. 

The numbers just discussed are first determined for planted phylogenies. We 
denote by P,, the number of planted phylogenies of magnitude n and P,,,, the 
number of these of order p. Further, we define the exponential generating 
functions: 


P(x)= > P,x"/n! (1.3) 
P(x, y)= > > Pr,pX"y"/n!. (1.4) 


We also wish to determine the y,, and the variance o%, of the number of 
vertices among all the phylogenies with magnitude n. We can achieve this by 
using the following recurrence relations for the first and second moments about 
the origin: 


2n-—2 
T= >) plap (1.5) 
p= 
2n-—2 
TQ = x p?Tn p- (1.6) 
c= 
Of course 
By = T/T, (1.7) 
and 
on =(T2/T,) — ui. (1.8) 


We define exponential generating functions for these moments: 
T(x) = Ss Tx" /n!, (1.9) 
n=1 


T(x) = > T2x" Jn! 
n=l 


We note that 

T(x) = T, (x, 1) (1.10) 
and 

T(x) = Tyy(x, 1) + T,(x, 1). (1.11) 


Hence we can evaluate the moment generating functions by using (1.10) and 
(1.11). 
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Let 
Sn = T?) > TY), 


R,=T., 
R(x) = T(x), and 
S(x) = T(x) — R(x). (1.12) 


We can compute values for To, T2, S,, u,, and o7 by using values for P,, T,; 
and (1.3)—(1.12). 


2. Transformation from single to multiple labels 


We note that Case 1 differs from another case analyzed previously by us in 
(6, §2] only in that multiple labels are allowed at each vertex, instead of being 
restricted to being at most one. Case 2 differs from that of [6, §3] in the same 
way. In this section we show how to derive the exact and asymptotic values of the 
numbers of interest in both of these cases from the available results for the 
singleton label cases. In fact, the assumptions required in our analysis are true 
also for four other cases previously studied by the authors ({3 §4, 4§2, 4§4, 
5 §2]). Thus the results to be derived could be used to give an alternative 
treatment of those cases. 

Let P, T, R, and S§ denote the functions for the singleton label case 
corresponding to P, T, R, and S respectively. Whereas the exponential 
generating function for labelling a vertex with a single label is x, it is (e* — 1) for 
multiple label cases. This is because any vertex may receive any positive number 
of labels. For any k =1 there is just one way to assign k labels to a vertex. 
Interleaving of label sets is accounted for in multiplying exponential generating 
functions together; see [8, chapter 1] for an account of the uses of exponential 
generating functions in labelled enumeration. Thus P(x) = P(e*— 1), T(x)= 
T(e* — 1), R(x) = R(e* — 1), and S(x) = S(e* — 1). From the fact that (e* — 1)*/k! 
is the exponential generating function 


Si S(n, k)x"/n! (2.1) 
n=l 
for Stirling numbers of the second kind, it follows that 
P,= >, S(n, k)P, n=. (2.2) 
k=1 


The analogous equations hold for 7, R,, 5, where in these cases n > 2. Stirling 
numbers are readily calculated from the recurrence 


S(n +1, k) =S(n, k—1) + kS(n, k) (2.3) 
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which holds for k 21, and from the boundary conditions: 
s(0, 0) =1, S(0,k)=0 if k>0, S(n, 0)=0 ifn>0. 


We now assume that P(x), T(x), R(x), and S(x) all have the same radius of 
convergence p, 0< <1, and that for each of these series the sole singularity on 
the circle of convergence |x| = fp is the positive real point x = 6 when x is viewed 
as a complex variable. We also assume that in some neighborhood of x = p, an 
expansion of the form 


P(x) = ap — a,(p — x)! + a,(p — x) + a3(p —x)i+- .- (2.4) 


is valid, where a, > 0. 
It should be clear that p = In(1+ p) is the radius of convergence of the series 
P(x). For if |x| <p then 
2 3 


x 
le*-1|= dere ae ns 
2 6 


2 3 

<p+o +o + --+ =e'-1=9, 

with equality only if x =. Thus P(x) is analytic inside the circle |x| = p, and at 

all points on the circle except possibly for x=p. By expanding P(x) in a 

neighborhood of x = p it will be shown that p is, in fact, a singularity of P(x). 

This establishes p as the radius of convergence of P(x), and also as the sole 
singularity of P(x) on its circle of convergence. 

Asymptotic results can now be deduced by applying Stirling’s formula, since 

the coefficient of x” in (1—x)~* is just ['(s+n)/I(s) F(n+1) provided that 


s #0, 1, 2,.... By Stirling’s formula this ratio can be expressed as 
ns! s(s — 1) 
1+ +O ) z 
ait a FOO) (2.5) 


for n—. In (2.4), then, the term —a,(p —x)} contributes 
ee ee 3 1 
4g 117 *pin7 ip! (1 + oe + o(-3)) 
to P,. The next term, a3(p —x)}, contributes 
das pin-Hp-(1 + o(-)) 


when taken to the same order. The remaining terms collectively contribute 
O(n-ip-™), as can be deduced from Darboux’s Theorem (see Theorem 4 of 
Bender {1]) or from Pélya’s Lemma (see [9]). Adding these up, we have 
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where 
5\ 4 
ree *) 
ap =A 
and 
- 3 3439 
Bp=-+-=— 
a ae 


(2.6) 


To use the relation P(x) = P(e* — 1) requires that e* — 1 be substituted for x in 
our expression for P(x). Using 6 =e? — 1 we find that (6 — x) should therefore 
be replaced by 


p—(e*-1)=pt+1—-e? °-*) 
=p+1—(p+ lee) 
= (p+ If1- eo) 
=(1+ p)(p —x)[1-3(p —x) +--+]. 
In this way (2.4) yields 


P(x) =-a,(1 + p)'(p —x)h + {2a + p)i+a;(1 +a)ih(o —x}i+--- 
As before, (2.5) then gives 


B 
mera a(i+ 64 0(7)) 


where 
a (1+ p)'p} 
Age 2x} 
and 


3 3a3 es 
=o ——(1+ p)p. 
B, gt + p) +a, (1+ p)p 
Substituting for a, and a; as in (2.6) one finds 
1+ p)p\}- 
A,= (ee) A (2.7) 


P 
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and 


we, p 
Bp= a+ {(1 + 2)B,— 3} =. 


The transformation of T(x) to T(x) proceeds in the same way, although some of 
the constants are different due to the fact that the leading term for T(x) is of 
order (p — x)? rather than (p — x)}. The resulting relations are 


Ar= (Pea, (2.8) 


and 
= 15 4\ AR. — 15,2 
Br = 3 + {(1 + p)Br - 8): 
Here Az, By are the constants such that 


T, 
— p'ni =Ar(1 pore o(-)) 
n! n n 


and Ar, Br play the same role for T,. 

The transformation of R(x) to R(x) is exactly like that of P(x) to P(x) because 
the leading term for R(x) is of the same order, (p — x)}, as for P(x). Therefore 
equations (2.7) apply directly with the subscript P replaced by R throughout. 

The transformation of 5(x) to S(x) is different in certain constants because the 
leading term for S(x) is of order (9 —x)~}. The relations for S(x) which parallel 
(2.7) and (2.8) are 


As = (tbe) A, (2.9) 
and 


Bs= —§+ {(1+ p)Bs + 3} 


Dl 


It is understood that 


Sn nl B, 1 
ni? n= As(1 +7+0(3)): 


and similarly for As and Bs. 
Since yu, = R,,/T, and ft, = R,,/T,, we can apply (2.7) (as it applies to R,, instead 
of P,) and (2.8), which give 


Br a,(1+7#+0(S)) 
n n n 
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A, = (Qe Prey A, (2.10) 


and 
3 =\h 4 2\P 
B= — 2+ {0 +p). +2} 5. 


Of course A, and B, are the constants giving the first two terms of the asymptotic 
expansion for ji,/n. 

To evaluate 02 asymptotically it is now assumed that AsA, = Az, which is 
equivalent to the asymptotic condition 


4. +of) 


One can then work from the relation 


Guage + o(~) 
n 
where 
Fr -1 
a= (2 vee) A, +A3(1- 2). (2.11) 
p p 
3. Case 1 


The results of Section 2 can now be applied with P(x), T(x), R(x), and S(x) 
taken to be the counting series for Case 1 of [6 §2]. There 6 = 0.3102333374. .., 
so we have p = In(1 + 6) = 0.2702052403. In [6 §2] the values of A,, B,, Ay, Br, 
Ar, Br, As, Bs, A,, B,, and A, are all determined to 10 significant digits. At the 
same time the various assumptions made in Section 2 concerning the series and 
their asymptotic behavior are verified. Consequently it is simply a matter of 
numerical calculation to use the values obtained in [6 §2] as input to equations 
(2.7), (2.8), (2.9), (2.10), and (2.11) to obtain: 


Ap = 0.1707982096. . . , 
B, = 0.3181314364.. ., 


Enumerating phylogenetic trees 


where 
P,=n! pn tAp(1 eign o(5)) , 
n n 
Also 
Ar = 0.0723245687.. . , 
Br= 2.043867691..., 
where 
B 1 
T, =n! p*n$A(1 4—T4 o(=)) : 
n n 
Also 
Ar = 0.0968670732..., 
Br = 0.9170219933..., 
where 
1 
R, =n! p-"n-4Aa(1 qe o(5)). 
n n 
Also 
A, = 1.339339038 ..., 
B, = —1.117847390..., 
where 
B 1 
b= n(a, ++ O(;5)). 
Also 
As = 0.1297377923..., 
Bs = —0.6559227561..., 
where 
: B 1 
S, =n! pn tAs(1 ++ o(-)) 
n n 
Finally 
A, =0.5229767183..., 
where 


o2=nA, + O(1). 
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4. Case 2 


Once again the results of section 2 can be applied with P(x), T(x), R(x), and 
S(x) taken to be the counting series for Case 2 of [6§3]. There p= 
0.2367771669..., so we have p=In(1 + p) =0.212508937.... In [6 §3] the 
values of Ap, Bp, Ar, Br, Arg, Br, As, Bs, A,, B,, and A, are all determined to 
10 significant digits. As in the previous section, the assumptions concerning the 
series and their asymptotic behavior are verified. Hence we used the values 
obtained in [6 §3] as input to equations (2.7), (2.8), (2.9), (2.10), and (2.11) to 
obtain: 


Sie ds B 1 
P,=n!p7"n :4,(1+="+ 0(-3)) 


where 

Ap = 0.1928169230. .., 

Bp = 0.5090632210..., 
Also 

T= ni p-*ntA(1 + ory o()) 
where 

Ar = 0.0794186525..., 

By = 2.259683002..., 
Also 

3 B 1 

R,, =n! p'n3Aq(1 + rn o(-)) 
where 

Ag = 0.1093548242..., 

Br = 1.047014483..., 
Also 

Hn = n( Ay ++ o()) 

n 

where 

A, = 1.376941321..., 

B, = —1.212668519..., 
Also 


= B 1 
S,=nip"n tAs(1 ae o(-)) 
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where 
As = 0.1505751762..., 
Bs = —0.5660531466. .., 
Finally 
o7,= nA, + O(1) 
where 


Ay = 0.3670613002... . 
§. Summary and conclusions 


In this paper we have completed our analysis of the enumeration of the twelve 
classes of phylogenies. The two final cases which we studied here both allow 
multiple labelling of vertices. They are: (i) no vertices of degree two and interior 
vertices may possibly be labelled, and (ii) there are no restrictions on vertex 
degree but only vertices of degree one or two can be labelled. The exact and 
asymptotic number of phylogenies with given magnitude, along with the mean 
and variance of their order was also determined. 

The method used to derive cases 1 and 2 from the previously analyzed 
singleton-label cases is a general one, which could have been applied to the other 
multiple-label classes of phylogenetic trees analysed in previous papers by the 
authors. It is in the nature of the general transformation that a linearly growing 
variance of the order in a singleton-label case implies the same behavior of the 
variance of the ordei in the corresponding multiple-label case. In fact the 
variance of the order is indeed linear in each class studied. 
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The triad counts of a graph G are the numbers of various distinct induced subgraphs of order 3. 
If G is an undirected graph, there are 4 triad counts, and if G is a digraph, there are 16 triad 
counts. A multigraph is a sequence G =(G,,...,G,) of graphs and digraphs defined on a 
common vertex set. The concept of triad counts is generalized to multigraphs with colored 
vertices, edges, and arcs. It is shown how triad counts in multigraphs can be used in various 
kinds of statistical analyses of graph data. In particular, probability distributions are 
investigated of the triad counts in random multigraphs. 


1. Introduction 


A dyad of a graph of digraph is an induced subgraph of order 2, and a triad is 
an induced subgraph of order 3. For a graph or digraph G of order n there are (3) 
dyads and (3) triads that can be partitioned into different equivalence classes 
according to isomorphism. If G is a graph, there are at most 4 nonisomorphic 
triads (Fig. 1), and if G is a digraph, there are at most 16 nonisomorphic triads 
(Fig. 2). The frequencies of isomorphic dyads and triads in G are called the dyad 
and triad counts of G. 

In statistical applications of random graph models the dyad and triad counts 
have been found useful both as summary statistics for exploratory analyses and as 
sufficient statistics for inference on particular random graph models. Some further 
comments and references on the use of dyad and triad counts in statistics are 
given in the next section. This shows that there is a need for extending the 
concept of dyad and triad counts to multigraphs G=(G,,..., G,), that is to 
ordered sequences of graphs and digraphs on a common vertex set. Various other 
extensions such as signed graphs, valued graphs and colored graphs are also 
discussed. Section 3 gives formulae for the numbers of nonisomorphic dyads and 
triads that exist for colored multigraphs. Section 4 discusses properties of the 
triad counts and some open extremal graph problems of interest in this 
connection. Section 5 gives some probabilistic properties of the triad counts for a 
random graph model that can be used for colored multigraphs. 


2. Statistical applications 


Data to be modelled by random graphs can be obtained from social contact 
networks, paired comparison experiments, political dominance patterns, unreli- 
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able component systems, and so forth. Often there is more than a single binary 
relationship to be investigated, and often there are variables defined both on the 
objects and on the pairs of objects involved. For instance, the vertices of a graph 
model can be people with vertex variables gender, age, income, etc. and edges 
and arcs for contacts having contact variables duration, purposes, gain, etc. 
Exploratory statistical analyses of graph data can be based on summary statistics 
like the triad counts. If different values on vertex, edge, and arc variables are 
represented by colors we can speak of triad counts in colored multigraphs. For 
instance, 4 arc colors are needed to represent the occurrences and non- 
occurrences of two different kinds of relationship, and 3 arc colors are needed to 
represent a single relationship that might be strong, weak, or absent. 

Several authors have used triad counts to investigate structural properties of 
empirical networks and to estimate and test particular random graph models. The 
16 triad counts in digraphs have been extensively studied in social science 
applications. See for instance Holland and Leinhardt [15, 16] and other articles in 
the same proceedings volumes. Transitivity indices defined in terms of triad 
counts were considered by Frank [6], Frank and Harary [10] and Holland and 
Leinhardt [13, 14, 15]. Frank [3,4] used triad counts to study transitivity and 
clustering properties of graphs. Frank and Harary [8] used triad counts in signed 
graphs to investigate structural balance. Frank and Strauss [11] proved that the 
triad counts are sufficient statistics for a particular class of Markov random graph 
models. 

We can distinguish between two different main uses of triad counts in statistical 
analysis. The triad counts can be used for estimating and testing particular 
random graph models, for instance estimating a Markov model or testing a pure 
randomness model versus a Markov model. A pure randomness model can here 
be a Bernoulli graph or a dyad independence model of the type that is described 
in Section 5. 

Triad counts can also be used in exploring and modelling graph data, for 
instance by using normalized triad counts as regressors for predicting graph 
properties or as discriminators for graph classification. By normalized triad counts 
is here meant observed triad counts divided by their estimated expected values 
according to some random graph model. Several examples of these uses can be 
found in the references given above. 


3. Numbers of dyads and triads 


The appropriate choice and scaling of variables for multivariate graph data can 
be partly guided by considering the effects on the numbers of dyads and triads, 
that is on the number of statistics to be used in the subsequent analysis. In order 
to determine these numbers in general we introduce a multigraph with colored 
vertices, edges and arcs. 
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Theorem 1. For multigraphs with a vertex colors, b edge colors and c arc colors 
there are (“3 ')b nonisomorphic dyads and 


be? + 
("5° *)-ere(3) 
nonisomorphic triads. 


Proof. Burnside’s lemma (see for instance Harary [12], p. 181) can be applied to 
count the numbers of nonisomorphic dyads and triads. The number of nonisomor- 
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phic dyads is given by the average number of fixpoints of the vertex permutations 
applied to all a*bc? labeled dyads that can be formed with the colors. For the 
identity permutation all labeled dyads are fix-points, and for the switch 
permutation there are abc fixpoints. Thus the number of nonisomorphic dyads is 
given by 4(abc* + abc) = (“'')b. For triads there are six vertex permutations: 
the identity, three switches, and two rotations. The numbers of fixpoints are 
found to be a°b°c® for the identity, a2b?c* for any switch, and abc? for any 
rotation. Thus the number of nonisomorphic triads is given by the average 
number of fixpoints 


(a°b°c° + ure + 2abe*) _ ae ’) _ abe‘), : 


By putting c=1 and b=1, respectively, we obtain the following two 
corollaries. 


Corollary 1.1. For graphs with a vertex colors and b edge colors there are (*3')b 
distinct dyads and (?°3'*) distinct triads. 


Corollary 1.2. For digraphs with a vertex colors and c arc colors there are (“7 ') 
distinct dyads and (“';"*) — a?c?(§) distinct triads. 


By putting a = 1, b = 2* and c = 24 we obtain the following corollary. 


Corollary 1.3. For multigraphs consisting of g graphs and d digraphs there are 
(24 + 1)24*8-" distinct dyads and ("5 *') +24(7"°5*") distinct triads. 


In particular, there are 20 triads (Fig. 3) for multigraphs consisting of two 
graphs, and there are 104 triads for multigraphs consisting of a graph and a 
digraph. 


4. Dyad and triad counts 


Consider colored multigraphs with a vertex colors, b edge colors and c arc 
colors. Denote the dyad counts of a colored multigraph G by (7, ..., ™), where 
k +1 is the number of nonisomorphic dyads, and these dyads have been labeled 
according to some fixed order. The triad counts of G are denoted by (fo, . .. , t;), 
where / + 1 is the number of nonisomorphic triads also labeled according to some 
fixed order. If G has order n, then 


Sn=(3) Zo=(5) 
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The dyad counts can be obtained from the triad counts. In fact, if 7,; is the number 
of i-dyads in the j-triad, then 


i 
> rt=(n-2)n, §=0,...,k. 
j=0 


Various properties of G can be expressed in terms of these counts. For 
instance, for a single graph G take r; and t; as the numbers of dyads of size i for 
i=0, 1 and triads of size j for j =0, 1, 2,3. If G has order n, size r, degrees 
d,,..., 4, and s two-paths, then by using that 


n d: 
s=t, 4+ 3t,= >, ( ' 
int \2 
we easily find that the mean and variance of the degrees can be obtained from the 


triad counts according to 


iw 2(t, + 2t2 + 343) 
24a 


i=1 n(n = 2) ‘ 
2(t2 + 3t3) 
ar ed . 


go 
n 
~ 3 (d,- d= d(1-d)+ 


A problem of general interest to statistical applications is to find upper and 
lower bounds to the triad counts of colored multigraphs G with specified dyad 
counts. For instance we easily find that t;<(n — 2)4r for graphs of order n and 
size r. Can better bounds be found if the graph has special properties like being 
planar, connected, transitive, and so forth? Frank [7] considered triads in planar 
graphs. For instance, it holds that ; < (n — 2)? for planar graphs of order n. For 
digraphs of odd order n, there cannot be more than }("3") 3-cycles. Frank and 
Harary [9] considered maximum triad counts in graphs and digraphs of order n, 
and Bollobdas [1] has a lot of material on related issues. For instance, for graphs 
of order n and size r 


max f3 = (2 dale () 


2 


where uv is given by (3) <r<("3'). A lower bound to #; for r= 4n? is given by 
r(4r —n”)/3n, and a lower bound to f, for 4n?<r <4n? is given by n(4r — n?)/9 
(Bollobds [1], Corollary 1.6 p.297 and Corollary 1.9 p.301). For n—© and 
r/(3)— p where } <p <4 we obtain that 


2(2p — 1)/3< s/(3) <p°”. 


A limiting version of interest of the problem for colored multigraphs is to let 
n—c so that the relative dyad counts tend to specified limits r;/($)— p; for 
i=0,..., k and ask for bounds to the relative triad counts ¢;/(3) for j=0,..., 2. 
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5. Random colored multigraphs 


In random graph theory the most common models are the uniform random 
graph of fixed size and the Bernoulli graph on a fixed finite vertex set, say 
{n]={1,...,}. A uniform random graph of size r and order n is a random 
graph that assigns the same probability (7)~' to each graph of size r on [n]. A 
Bernoulli graph of order n with edge probability p is a random graph that assigns 
probability 


p(l— py?" 
to each graph of size r on [n] for r=0,..., (5). These models are treated 
extensively in the recent monographs by Bollobds [2] and Palmer [17]. 

A natural extension of the Bernoulli-graph to digraphs is the dyad independ- 
ence model considered by Frank [6]. According to this model the (3) random 
dyads are independent identically distributed with a probability distribution that 
is invariant under isomorphism. Thus, the random digraph distribution is 
specified by the probabilities assigned to the dyads of size i for i = 0, 1, 2. 

For a colored multigraph with no vertex colors we can define a similar general 
dyad independence model by specifying the probabilities p; assigned to the 
nonisomorphic dyads i=0,...,k. Here p;20 and pot+---+p,=1. Each 


labeled colored multigraph on [n] with dyad counts (7,...,7%) is assigned a 
probability 
k 
IT pi. 
i=0 
Let (Ro, ..., R,) be the random dyad counts and (%,..., 7;) the random 
triad counts of a general dyad independence model. Now (R,,..., Rx) is 
multinomially distributed with parameters (3) and (p;,..., p,). The distribution 


of the triad counts is not easy to determine. However, the expected values and 
the variances and covariances can be obtained by generalizing the methods used 
by Frank [5]. We have the following result. 


Theorem 2. For a random colored multigraph of order n the triad counts 
(To, ..., Tj) have expected values ET; = (3)P; and covariances 


Cov(T, 7) = (3) P (8; B) + 12(7)(P, - BP) 


where P, is the probability that a random triad is isomorphic to the i-triad, P, is the 
probability that two random triads with two vertices in common are isomorphic to 
the i-triad and the j-triad, resp., and 6,; is 1 or 0 according to whether or not i =}. 


In order to determine P, it is convenient to use that 


3! 


= ——__—__ pp... 
f=, {PO + PR'W; 
Tj: 7°" * Ves 
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Tabte 1. Triad probabilities of digraphs. 
(Dyads and triads labeled according to Fig. 2.) 


Triad Dyad counts Proportion Probability 
i Tol yi? Labelings =; P, 
0 300 1 1 Po 
t 210 6 1 3p2p, 
2 201 3 1 3poP2 
3 120 6 } 3popi/ 2 
4 120 3 1 3pop3/4 
5 120 3 7 3popil4 
6 030 6 3 3p?/4 
8 ttt 6 A 3PoP1P2 
9 it 6 S 3PoP 1P2 
10 021 3 i 3p? p,/4 
tt 021 3 ; 3p p2/4 
12 021 6 i 3p p2/2 
13 102 3 1 3pop> 
14 O12 6 1 3p,p2 
15 003 1 1 P3 


Table 2. Triad probabilities of multigraphs consisting of two 
graphs. (Dyads and triads labeled according to Fig. 3.) 


Triad Dyad counts Proportion Probability 
i Toshio? Labelings , P 
0 3000 1 1 Pe 
t 2100 3 1 3pip, 
2 2010 3 1 3pop2 
3 2001 3 1 3pop3 
4 1200 3 ‘t 3pop? 
5 1110 6 l 6PoP1P2 
6 1020 3 1 3pop2 
7 1011 6 1 6poP2P3 
8 0210 3 1 3pi p2 
9 0030 1 1 p> 
10 0300 1 1 Pi 
tt 0120 3 1 3p, pz 
12 1101 6 1 6PoP1P3 
13 0201 3 1 3pips 
14 Otit 6 1 6P1 P2P3 
15 0021 3 1 3p3p3 
16 1002 3 1 3poP3 
17 0102 3 1 3p, P3 
18 0012 3 1 3p2P3 
19 0003 1 t P3 
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where (7o;, . .. , %) are the dyad counts of the i-triad, and w, is the proportion of 
triads isomorphic to the i-triad among the triads which have the same dyad counts 
as the i-triad. In particular, w;=1 if the i-triad is uniquely specified by its dyad 
counts. Otherwise there are several non-isomorphic triads which have the same 
dyad counts as the i-triad. For each such triad we find how many isomorphic 
labelings there are and then obtain w; as the proportion of labelings for the 
i-triad. 

Consider for example a single digraph. Dyads and triads are given in Fig. 2. 
The dyad counts of the triads, the numbers of isomorphic labelings, and the 
probabilities are given in Table 1. Table 2 gives corresponding quantities for a 
multigraph consisting of two graphs. We note that when c = 1 all nonisomorphic 
triads are uniquely specified by their dyad counts. 

The main use of Theorem 2 in exploratory statistics is that it shows how the 
triad counts can be normalized to define regressors and discriminators. We 
normalize according to X; = T/(3)B where P is obtained from P, by substituting 
maximum likelihood estimators for the dyad probabilities, that is p; is replaced by 
Bp; = R,/() for j =0,..., k. The probabilistic properties of (X,,..., X;) are not 
well known, 
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In this paper we give an algorithm for generating all tournament score sequences of a given 
length in lexicographic order. We also show how to construct a tournament with a given score 
sequence. 


1. Introduction 


A tournament of order n is a loopless digraph T=(V,A), where V= 
{1, 2,...,m}, such that there exists exactly one arc between any pair of distinct 
vertices i and j. The score of a vertex i, denoted by s7(i) (or simply s(é)), is 
defined as the outdegree of i. Unless otherwise specified, we shall assume that 
s(1) <s(2)<--+-<s(n). The sequence (s(i)) = (s(1), s(2),...,8(n)) is called 
the score sequence of T. 

To be able to enumerate all score sequences of a given length, one must first 
know a necessary and sufficient condition for a non-decreasing sequence of 
integers to be a score sequence. Such a condition is given in a characterization 
theorem proved by Landau [4] in 1953. 


Theorem 1 [4]. A non-decreasing sequence of n integers {s(i)) is a score sequence 
if and only if 


Sa0-(" 


i=1 


for each k =1,2,...,n with equality for k =n. 


We shall find an equivalent characterization of score sequences which is very 
helpful in the lexicographic enumeration algorithm established in this paper. 

Let (s(i)) be any sequence of integers. We define the (transitive) deviation 
sequence of {s(i)) to be the sequence (d(i)) = (s(i) —i+1). We shall call d(é) 
the deviation of s(i). It is easy to see that (s(i)) is non-decreasing if and only if 
d(i)-—d(i+1)<1 for each i<n. It is also easy to check that for each 
k=1,2,...,n, DE, di) =ZE,s(i)—(§). From this, we see that a non- 
decreasing sequence of n integers (s(i)) is a score sequence if and only if its 
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deviation sequence (d(i)) satisfies DX, d(i) =0 for k =1, 2,..., with equality 
for k =n. 


2. Lexicographic enumeration of score sequences 


Let (s(i)) and (s'(i)) be score sequences of length n. We say that (s(/)) 
precedes (s'(i)) if there exists a positive integer k <n such that s(i) =s’(é) for 
each 1<i<k and s(k)<s‘(k). (They are equal if equality holds for all i.) In 
symbols we shall write (s(i)) < (s’(i)) if (s(i)) precedes (s‘(i)). We say that 
(s'(i)) is the successor of (s(i)) if they are distinct, (s(i)) <(s‘(i)), and 
(s'(i)) < (s"(i)) whenever (s(i)) < (s"(i)). 

An enumeration of all score sequences of a given length with the property that 
the successor of any score sequence follows it immediately in the list is called a 
lexicographic enumeration. 

Let us take note of two important facts. First, the score sequence 
(0, 1, 2,...,—1) is not the successor of any score sequence of length n and 
hence it is always the first in the lexicographic enumeration. Second, (s(i)) has 
no successor if and only if s(m) — s(1) $1. 

If we know the first sequence in a lexicographic enumeration, then we can 
complete the work provided we know how to get the successor of any given 
sequence. 

The successor (s'‘(i)) of (s(i)), if there is any, can be obtained as follows. 

(1) Determine the maximum k such that s(n) — s(k) 2 2. 

(2) Let s'(i) = s(é) for all i, i<k. 

(3) Let s'(k) = s(k) +1. 

(4) Let s’(j) =s(k) +1 until D4. s'(i) < (4). 

(5) Let ¢ be the minimum j such that ¥_,s'(i) < (4); 

Set s'(t)= (3) — Lizis’(i). 


(6) Let s'(é)=i—1 for all i, t<i<n. 


3. Constructing a tournament with a given score sequence 


One method of constructing a tournament with a given score sequence can be 


found in [1]. Here we shall give another method of construction and prove its 
validity. 


Lemma 1. Let (s(i)) be a score sequence of length n with deviation sequence 
(d(i)). 
(a) If max{d(i)} = M>0, then for each 1<k <M, there exists a vertex x such 
that d(x) =k. 
(b) If min{d(i)} =m <0, then for each -12k =m, there exists a vertex x such 
that d(x) =k. 
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Proof of (a). Certainly, (a) is true for k = M. We shall prove that (a) is true for 
all 1<k <M by (backwards) induction on k. Let 1<k <M and assume that (a) 
holds for k + 1, i.e. there exists a vertex x such that d(x) =k + 1. Since d(n) <0, 
there exists ¢>x such that d(x)=d(t)>d(t+1). Since d(t)<d(t+1)+1, we 
have d(t+1)=d(x)—1=k. Hence (a) holds. Part (b) can be proven using a 
similar kind of argument. O 


Lemma 2. Let (s(i)) be a score sequence of length n with deviation sequence 
(d(i)). If c is the number of negative terms, then c = max{d(i)}. 


Proof. Let p = max{d(i)). If p=0, then c=0 by Theorem 2 and the Lemma 
holds. If p > 0, we consider the following cases. 


Case 1. There exists a vertex k such that d(k) <0 and |d(k)| =p. 
By Lemma 1, c > |d(k)| =p. 
Case 2. For each negative deviation d(k), |d(k)| <p. 


Let q = max{|d(i)}|:d(i) <0} and suppose that c<p. Then using Lemma 1, 
La<old()|<1+2+---+q+(p-—q)q. But by Theorem 1, Yayy<old(i)| = 
Yaiy>04(i) and by Lemma 1, Yagy>od(i)=1+2+---+p. Hence, q(q + 1)/ 
2+(p—q)q>p(p+1)/2. This gives the quadratic inequality p?— (2g — 1)p + 
q(q — 1) <0 which implies that g-— 1<p<gq. This is absurd since p and q are 
integers. Hence,c2p. O 


We are now prepared to describe and validate an algorithm for constructing a 
tournament with a given score sequence, which is a main result of this paper. 


Construction algorithm. Let (s(i)) be a score sequence of length n with deviation 
sequence (d(i)). First we take n vertices arranged horizontally and labelled 
1, 2,..., from left to right. 


Step 1. Subdivide (d(i)) into maxima! non-increasing segments and denote by p 
the number of segments in the subdivision. Let n,; be the number of negative 
deviations in the ith segment (counting from left to right). 


Step 2. Let j be the least integer such that d(j) >0. If no such j exists, go to Step 
6. Else, determine the least integer q such that U,<,n;2=4(j). For each i in the 
segments to the left of the qth segment such that d(i) <0, let d’(é) = d(é) + 1 and 
draw the arc ji. 


Step 3. Let o=d(j)— Uicgn; and choose o smallest (negatively largest) devia- 
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tions d(i) in the qth segment. For each such d(é), let d’(i) = d(é) + 1 and draw the 
arc ji. Let d'(j) =0. 


Step 4. For all other deviations d(i) not changed in the preceding steps, let 
d'(i) = d(i). 
Step 5. If (d'(i)) # (0), go to Step 1 using (d’(i)) in place of (d(i)). 


Step 6. Whenever u < v and there is no arc between u and v, draw the arc vu. 


Step 7. The resulting digraph is a tournament with score sequence (d(i)). 


Let us now analyze Algorithm 3 to see its validity. Clearly, Step 1 can always 
be carried out. Step 2 can be done in view of Lemma 2. Step 3 can be 
implemented because of Step 2. We can obviously do Step 4. Furthermore, we 
see that after this step, (d'()) satisfies d'(i) — d'(i + 1) <1 for all 1<i<n. Now, 
what happens when we reach the stage (d'(i)) = (0)? Let G' be the digraph 
formed at this stage. Then it is easily seen that for each vertex i in G’, we have 


._ fd(i), ifd()>0 
oda(i)= {5 if d(i) <0 
eeree Lue if d(i) =0 
agli= (ee if d(i) <0 


<i> a 

<d¢i)> : =} =} 

<d°¢id>: O {e) {e) - - _ -1 -1 
Q Q 2 i ce] ce] ie) {e) 9 -} -1 =i 
QO ce) 9 } Q Q ie] (0) 9 QO {e) =1 
Q Qo oO Q 9) fe) ie) Q ie) 


Fig. 1. The above digraph is the result after using Algorithm 3 up to the stage where (d‘(‘)) = (0). 
We complete it into a tournament by adding all the arcs ij, (i >/), between two vertices i and j which 
are not yet jointed by any arc. 
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Let T be the tournament formed after Step 6, and let i be any vertex of T. If 
d(i)=0, then we have s(i) = od(i) = od,(i) +i-—1=s(i). If d(@@) <0, then we 
have id(i)=id,({i)+n—i=-—d(i)+n~-i and hence s(i)=od(i)=(n—-1)- 
id(i) = d(i) + i-1=s(i). Therefore, the score sequence of T is (s(i)). 


Example. Construct a tournament with score sequence {s(i)) = (3, 4, 4, 4, 4, 4, 
6, 6, 6, 6, 9, 10). 

We have (d(i)) = (3, 3, 2, 1, 0, —1, 0, -1, -2, —3, -—1, —1). Using the 
algorithm up to the point when (d’(i)) = (0), we obtain the graph G’ shown in 
Fig. 1. (The maximal non-increasing segments of each (d‘(i)) are underlined.) 
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Problems on chain partitions 


Recently I have come across several fundamental problems concerning the 
existence of partitions of posets into chains satisfying certain conditions. Other 
problems of this sort have been open and frustrating researchers for years. I have 
collected these problems here, together with a few of my own, to give them wider 
exposure. The problems can be easily stated for a broad mathematical audience. 
It is likely that their solutions require techniques and insights which differ greatly 
from those normally employed in the contexts in which the problems were 
initially developed. 

The presentation of the problems follows a review of the necessary notation 
and terminology. We first discuss problems about the poset of subsets of a finite 
set, and then we give problems for other families of posets. 


Notation and terminology 


Throughout we consider finite posets P = (P, <). A totally ordered subset of 
P, say C={x,,...,x,}oP with x,;<x,<---<-x,, is called a chain. Such a 
chain is saturated (also called unrefinable or consecutive) if for all §=2, x;,, 
covers x;, that is, x;<y <.x,,, for y in P only if y =x,. An antichain is a totally 
unordered subset of P. The width of P, denoted d,(P), is the maximum size of an 
antichain in P. Let A be a maximum<-sized antichain, and let C={C,,..., C,} 
be a partition of P into chains C;. Since any chain C; intersects the antichain A at 
most once, it follows that 


|C| =s > |A| =4,(P). 


A theorem of Dilworth [5] states that this bound on the number of chains in a 
chain partition is best-possible: There exists C such that |C| = d,(P). 

We next recall an important generalization of Dilworth’s Theorem. Given 
k =1, a subset F of P is a k-family if it can be expressed as the union of at most k 
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antichains (or, equivalently, if |F 7 C|<k for every chain Cc P). Let d,(P)= 
max{|F|:F cP a k-family}. A chain partition C={C,,...,C,} induces an 
upper bound on the size of a k-family F in the following way: 


IFl=> IFN Cl <> min(k, |C;)). 


We denote this last sum by d(k, C). By taking F to be a maximum-sized k-family, 
we obtain d(k, C) =d,(P). If d(k, C)=4,(P), C is said to be k-saturated. The 
Greene-Kleitman theorem [8] states that for all k and P there exists a k-saturated 
chain partition C. Dilworth’s Theorem is the case k = 1. 

All posets P we consider are graded which means that every maximal chain C 
in P has the same size. The rank r(x) of an element x in a graded poset P is one 
less than the maximum size of the chains which contain x as the top element. The 
rank of P, r(P), is the maximum rank of any element in P. Let P= {xe 
P:r(x) =i}. This partitions P into antichains, Py,..., P,, where n =r(P). 

The sequence of Whitney numbers of a graded poset P is (Wo, W,,..., W,) 
where W,=|P;| and n=r(P). P is rank-symmetric if W,=W,_, for all i, and 
rank-unimodal if for some j, Wo < W, <--- <W, and W,>W,,,2--->W,. 

The union of the k largest rank-sets P, of P is a k-family so d,(P) is at least the 
sum of the k largest Whitney numbers W,. P has the strong Sperner property if for 
all k, d,(P) actually equals this sum of the k largest Whitney numbers. A Peck 
poset is a rank-symmetric, rank-unimodal poset with the strong Sperner property 
(cf. survey [13]). 

If a graded poset P of rank n has a partition C into chains such that each chain 
in C is saturated and symmetric about middle rank, 4n, then P is called a 
symmetric chain order. This means that for each chain C in C there exists an i 
such that C consists of one element of each rank i,i+1,...,n—é Computing 
the bound d(k, C), we find that d(k, C)=4d,(P) is the sum of the k middle 
Whitney numbers, so that P is a Peck poset. We also conclude that the partition 
C is completely saturated, which means it is k-saturated for all k. 


Problems concerning subsets 


Let [n]={1,...,”}. Let B, =(2!"!, c) be the Boolean algebra of order n, 
which is the poset of all subsets of [n], ordered by inclusion. Sperner’s Theorem 
{22] states that B, has width (\j,1), the size of the middle rank(s). A stronger 
result, due to deBruijn et al. [4], is that B, is a symmetric chain order (cf. 
{10, 11]). 


1 (Z. Firedi [7]). Can B,, be partitioned into (,j,,) chains of the same size (within 
one)? That is, letting a and 5 satisfy 2” =a(\y,;)+6, where 0<b <(ig,)), we 
require b chains of size a + 1 and (,4,;) — 5 of size a. This is the minimum number 
of chains which can cover B,, in light of Sperner’s Theorem. This problem seems 
to be hard. 
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2 (B. Sands [21]). Can B,, be partitioned into chains of size 4 for sufficiently large 
n? More generally, for any k, can B, be partitioned into chains of size 2“ for all 
sufficiently large n, given k? 

In this question note that 2* divides |B,| = 2” for n =k, so a complete partition 
is possible. For k = 1 it is trivial to partition B, for n>1, e.g. take the chains 
{X, XU{n}} for Xc{n-—1]. For k=2, Griggs et al. [26] recently solved 
the original problem by showing that B, can be partitioned into chains of size 4 if 
and only if n =9. It is impossible for n <8 since then the number of chains, 2”~°, 
is less than (j\g,;). For n =9 they have an ad hoc construction, which can be 
extended automatically by induction to all n >9. However, the method does not 
extend to the general k problem. Of course it would be very nice if B,, could be 
partitioned into chains of size 2" if and only if the number of chains 2"~* < (jj). 

We propose a stronger conjecture which involves chain sizes other than powers 
of 2. Given c= 1, can B,, be partitioned into chains of size c, except for at most 
c—1 elements, which also belong to a single chain, for n >no(c)? This is trivial 
for c = 1 and is true for c = 2, 4 by the remarks above. For c = 3 it turns out to be 
true for all n. 


3, Problems 1 and 2 seem promising particularly because Sperner theory is 
consitent with them. The strongest conjecture about chain sizes in partitions of B,, 
that is consistent with Sperner theory is this: Let A=(A,=A,=---) be any 
partition of the integer 2” into parts A;=0. Let o=(0,20,2:--) be the 
partition of 2” corresponding to the sizes of the chains in the symmetric chain 
decomposition of B,,. Thus, 


O,=n+t1, 02=°°=Oy=n-1, Omer =" = Oy =n - 3, 


and so on. A term go; >0 if and only if i <(4f,)). 
The conjecture is that there is a partition of B, with chain sizes A if and only if 
o =A in the majorization order, that is, ) 0; = A; and for all; 


5 0; > > hi. 
i=1 f=1 

The “only if” direction is a consequence of the symmetric chain decomposition 
being completely saturated. We have only checked the converse direction for very 
small cases, n <4. The analogous conjecture for arbitrary posets is easily seen to 
be false, so this conjecture may be overly optimistic. 


4. Determine partitions 4 of 2” such that B, can be partitioned into saturated 
chains with sizes 4. The symmetric chain decomposition described by o above 
consists of saturated chains, but the conjecture for saturated chains analogous to 
problem 3 is false: There exists A<o not realizable by a partition of B, into 
saturated chains. 
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We recently obtained a partial answer to this problem {14]: Let G(n, c, k) = 
Y {(}):i=k (mod c)}. If B, is partitioned into saturated chains of size at most c 
(i.e. if A, <c), then we show that 

(i) the number of chains of size c is at most G(n, c, [4(n +c)]) (i.€.-4; <c for 

i>G(n, c, |4(n+c)]), and 

(ii) the total number of chains is at least G(n,c, |4n]) (i.e. A;>0 for 

i=G(n,c, [4n]). 

Obtaining these bounds is easy, but it is interesting that both bounds are 
best-possible. 


Problems for other posets 


5 (R. Stanley [23, p. 182]). Let L(m, n) denote the lattice of Ferrers diagrams 
fitting into an m X n rectangle, ordered by inclusion. Equivalently, L(m, n) is the 
poset of all integer sequences § = (0<a,<---<a,,<n) ordered by S <S" if and 
only if a;<a; for 1<i<m. Is L(m,n) a symmetric chain order? This was first 
shown for m<4 and all n by Riess {20] and later rediscovered for m=3 by 
Lindstrém [17] and for m=4 by West [25]. Stanley proved in general that 
L(m, n) is Peck (cf. [24, 15, 19] for later, more elementary proofs). Stanley {23] 
studied more generally a class of posets related to the Bruhat order of Weyl 
groups. He showed that these posets are Peck and asked whether they are 
symmetric chain orders. 


6 (A. Bjdrner {3, p. 189]). Is the weak ordering of the symmetric group, S,, a 
symmetric chain order? In this ordering, a permutation a = (a;,...,a,) covers 
b=(b,,...,6,) if @ is obtained from 6b by transposing some adjacent pair 
b,, 6:4, in b with b; <b;,,. For instance (1, 4,2, 3) covers (1, 2,4, 3) in S,. The 
minimum element is thus (1, 2,...,), the maximum element is (n,n —1, 
n-2,..., 1), and the rank of this ordering of S, is (n(n — 1)). This ordering is 
rank-symmetric and rank-unimodal, but it is open whether it even has the 
Sperner property. The same questions are posed more generally for the weak 
ordering of a Coxeter group (W, S) with W finite [3]. 


7 (Folklore). Let L,(q) denote the lattice of subspaces of an n-dimensional 
vector space V over the finite field GF(q), ordered by inclusion. Then a subspace 
S of V has rank equal to its dimension. L,(q) is known to be a symmetric chain 
order [1, 12] by an existence proof which exploits the regularity of the lattice and 
combinatorial matching theory. The problem is to give an explicit symmetric 
chain decomposition analogous to the explicit ones for the Boolean lattice and the 
lattice of divisors of an integer. Such a result may require a nice method of 
describing the subspaces which would be helpful for other problems about L,(q), 
e.g. proving a subspace analogue of the Kruskal—Katona theorem. 
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8 (K. Engel [6]). An ordered partition C of a poset P into chains C,,..., C, is 
said to be admissible if |C;| <3 and if whenever two elements x and y belong to CG 
and C; has at least two elements above x and below y, there is some element z in 
C,U---UC;)_, such that x <<z<y. The problem is to show that the product of 
three chains with equal length has an admissible ordered symmetric chain 
decomposition. This poset is known to be a symmetric chain order [4], so the 
problem is to find a decomposition which can be ordered appropriately. Engel 
found an admissible decomposition for the product of just two chains of equal 
length using a lovely zigzag construction. This result extends to every product of 
an even number of equal length chains by a product theorem for admissible 
partitions. To extend this to any product of at least two chains of equal length it 
suffices, by the product theorem, to solve the case of just three chains. This 
cannot follow simply from the product theorem since a single chain of size at least 
4 is not admissible. Supporting evidence for the conjecture is the discovery by 
Mahnke [18] of admissible decompositions when the chain size is at most 5. 

If the desired result is true, there would be a nice application to computing the 
minimum number of evaluations required to completely determine an unknown 
order-preserving map f :P— Q when P, Q are finite posets and P is a product of 
chains of equal length. 


9 (Griggs [12]). A finite ranked poset P has the LYM property if for all k > 0 and 
all subsets A c P,, the set 0A of elements of P,_, covered by some element of A 
satisfies 

|9Al IAL 

[Pe-al [Pel 


One class of LYM posets is the regular posets, which have the property that for 
all k, every element of P, covers the same number a, of elements of P,_, and is 
covered by the same number f, of elements of P,,,. Anderson {1] and, 
independently, Griggs [12] proved that every rank-symmetric, rank-unimodal 
LYM poset is a symmetric chain order. The problem is to say something about 
LYM posets in general. Specifically, Griggs has conjectured since 1975 that LYM 
posets have completely saturated partitions. Since LYM posets P are known to be 
strong Sperner {10], this says equivalently that there is a partition C of P into 
chains such that whenever a chain C in C contains an element of any rank P, it 
also contains an element of each rank P; such that |P| > |P;|. It might be easier to 
prove a considerably weaker result, e.g. that a rank-unimodal regular poset has 
such a completely saturated chain partition. However, no progress has been made 
on this problem since its formulation. 
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Let G be a graph and x9, x, be two different vertices of G. Two players, Black and White, 
mark alternately non marked edges of G. White loses if and only if he marks all edges of a path 
connecting x) and x,. This game is the misére version of the well-known Shannon Switching 
Game. We give its classification as a particular case of the classification of a more general game 
played on a matroid. 


1. Introduction 


Let M be a matroid and ee E(M). Two players, Black and White, mark 
alternately non marked elements of E(M) — e. In the ordinary Shannon Switching 
Game, White wins if he marks all elements of a circuit broken at e; otherwise 
Black wins. We recall that a circuit broken at e is a set of the form C — e, where C 
is a circuit of M containing e. We define the Misére Switching Shannon Game on 
M with respect to e as follows. The rules are the same as above except that White 
loses if he marks all elements of a circuit broken at e and wins otherwise. 

The Shannon Switching Game was introduced by Shannon for graphs and 
generalized to matroids by Lehman in [4]. Lehman has given a complete 
classification of this game [4]. 

The Misére Shannon Switching Game is different from the misére game 
considered by Kano for graphs [5] and generalized to matroids in [3], where 
White loses if and only if he marks a circuit of M. 

A matroid M is called a block if E(M) is the union of two disjoint bases. A 
block of M is a subset X c E such that the induced matroid M(X) is a block. 


Theorem 1A (Lehman [5]). Let M be a block. Then White playing second can 
mark a base of M. 


Theorem 1B (Lehman [5]). Let M be a matroid and e€ E(M). The Shannon 
Switching Game with respect to e has the following classification. 
(i) White wins playing second if and only if there is a block of M not 
containing e but spanning e in M. 
*C.N.R.S. avec le soutien du PRC Math-Info. 
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(ii) The first player wins if and only if there are blocks containing e in both M 
and M*. 

(iii) Black wins playing second if and only if there is a block of M* not 
containing e but spanning e in M*. 


A survey of Lehman theory and short proofs of Theorem 1A and 1B are 
contained in [2]. 


2. A solution to the Misere Shannon Switching Game 
We use the following lemma. 


Lemma 2.1. Let M be a matroid and X be a block of M. Two players Black and 
White play alternatively by marking elements of M. Then White playing first resp. 
second can force Black to mark a basis of M(X). 


Proof. Since M(X) is a block, then so is (M(X))*. By Theorem 1A there is a 
strategy ¥ for White playing second (resp. first) for marking a base of (M(X))*. 

White playing second uses the following strategy: 

(1) If Black marks an element of E(M)— X, then White marks any element of 
E(M) — X if there remains any. Otherwise White considers a fictitious move on X 
played by Black and answers according to J. 

(2) Black marks an element of X which is not a fictitious move. Then White 
answers according to ¥. 

(3) Blacks plays a fictitious move (i.e. an element chosen by White as a 
fictitious move of Black in the previous course of the game). Then White choses 
any non marked element if there remains any as a new fictitious move of Black 
and answers according to ¥. If there is no such element the game is Over. 

Observe that up to the order of moves, on X White answered to all moves of 
Black according to ¥. Hence White has marked a base of (M(X))*. It follows 
that Black has marked a base of M(X). 

The case White playing first reduces to the case playing second by considering 
the game on M plus a loop played as first move by Black. O 


The following definition is given in [1]. 

Let E be a set and € € 2°. Two players, Black and White, mark alternately non 
marked elements of E. White wins if he marks all elements of a set in ©. 
Otherwise Black wins. 

This game is a positional game of type | with set of winning configurations €. 
We denote it by (E, ©). 


Proposition 2.2. Let G=(E, 6) be a positional game of type 1. Suppose that 
White has a winning Strategy playing second. Then White has a winning strategy 
playing first an arbitrary element ee E. 
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Proposition 2.2 is an easy refinement of the well-known fact that if White has a 
winning strategy playing second, then he has also a winning strategy playing first 
(see [1, 4]). 


Theorem 2.3. The Misére Shannon Switching Game on a matroid M with respect 
to an element e € E(M) has the following classification: 
(i) If there is a block of M not containing e but spanning e in M then this game 
is winning for Black. 
(ii) If there is a block of M and a block of M* containing e, and |E| is odd, 
then the first player wins. 
(iii) If there is a block of M and a block of M* containing e, and |E| is even, 
then the second player wins. 
(iv) If there is a block of M* not containing e but spanning e in M* then White 
wins. 


Proof. (i) Let X be a block of M spanning but not containing e. By Lemma 2.1, 
Black can force White to mark a base of M(X). Then White loses. 

(ii) Let X and X’ be blocks of M and M* containing e respectively. Note that 
|E| — |X| and |E| —|[X’| are both odd. 

(1) White playing first uses the following strategy: The first move is any 
element of E\X’. 

If Black marks an element of E—X’, then White marks any non marked 
element of E — X' (always possible for parity reason). 

If Black marks an element of X’, then White plays according to the strategy 
given by Proposition 2.2 in the game on (M*(X’))* with @ the set of bases of 
(M*(X’))* and e being his first move. Then when the game is over White has 
marked a base of (M*(X’))*/e, hence Black has marked a base of M*(X’) not 
containing e but spanning e in M™*. It follows that there is no white circuit broken 
at e in M. 

(2) Black playing first. 

As in (1), Black forces White to mark a base of X. Then White marks a white 
circuit broken at e. 

(iii) There is a block X of M and a block X’ of M* containing e and |E| is 
even. 

Note that |E| — |X| and |E| —|X’| are both even. 

(1) White playing first. 

Black uses the following strategy: If White marks an element of X, then Black 
plays according to the strategy given by Proposition 2.2 in the game on (M(X))* 
with @ the set of bases of (M(X))* and e being his first move. 

If White marks an element of E — X, then Black plays any element of E — X. 

Using this strategy, Black forces White to mark a base of X. Hence White 
marks a circuit broken at e. 

(2) Black playing first. 
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White uses the strategy in (1), where X is replaced by X' and M(X) by 
M*(X'). 

(iv) Let X' be a block of M* spanning but not containing e. by Lemma 2.1 
White can force Black to mark a base of M*(X). It follows that there is no white 
circuit broken at e when the game is over. O 
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1. Background 


Recently, some remarkable connections between commutative algebra and 
combinatorics have been discovered. Among the main topics in this area are the 
concepts of Cohen—Macaulay and Gorenstein complexes. 

Recall some basic definitions. Let V = {x,, x2,...,x,} be a finite set and Aa 
simplicial complex on V, i.e. A is a collection of subsets of V satisfying (1) 
{v} eA for all ue V and (2) ae A, tco imply te A. The dimension of A is 
(max,., #(c)) — 1, where #(o) is the cardinality of o as a set. If #(0) =i +1, 
then a is called an i-face. Also, let A=k{x,, x2,...,x,] be the polynomial ring 
in n-variables over a field k. We regard the elements x, as indeterminates over k. 
Let I, be the ideal of A generated by all square-free monomials not contained in 
A, namely 


Ty = (Xi, Xi, °° * Xi, | a, <ig<- + ci,, (Xi) Xi Xi} € A). 


Now define k[A] to be A/J,, the quotient ring of A mod I,. k{A] is called the 
Stanley—Reisner ring of the complex A. 

We call A a Cohen-—Macaulay (resp. Gorenstein) complex over k if the 
corresponding Stanley—Reisner ring k[A] is a Cohen—Macaulay (resp. Goren- 
stein) ring. The concept of Cohen—Macaulay or Gorenstein rings is very 
important in commutative algebra. Refer to Hochster [4] and Stanley [8] for the 
definition and elementary properties of Cohen—Macaulay and Gorenstein rings. 

Now, the following theorem of Reisner is very important in the theory of 
Cohen—Macaulay complexes. 
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Theorem (Reisner [6]). A complex A is Cohen—Macaulay over a field k if and 
only if for all a€ A (including 0 =), H,(link,(o), k)=0 for i< dim(link,(a)), 
where link,(o) = {te A; TNG =H, TUdE A}. 


In particular, if the geometric realization |A| of A is a triangulation of a sphere, 
then A is Cohen—Macaulay over an arbitrary field k. Stanley [7] used this fact to 
prove the so called upper bound conjecture for spheres. Stanley’s method is one 
of the most dramatic applications of commutative algebra to combinatorics. 

Also, a finite poset P is called Cohen—Macaulay (resp. Gorenstein) over k if the 
order complex A(P) is Cohen—Macaulay (resp. Gorenstein) over k. Here, A(P) 
is the set of chains (totally ordered sets) of P, which turns out to be a simplicial 
complex on the set of elements of P. We define the rank of a poset P to be the 
dimension of A(P). 

The theory of Cohen—Macaulay posets has been developed by many mathe- 
maticians, notably, Baclawski, Bj6rmer, Garsia and Stanley. 

On the other hand, DeConciai et al. (cf. [1]) introduced the concept of 
algebras with straightening laws. Let R = ® 20 R, be a commutative graded ring 
defined over a field k (= Ry) and P a finite poset contained in R,. Then a product 
@,a@,°:° a, of elements of P in R is called a monomial, and a monomial 
@,@--+ a, is called standard if a, <a,<---<a,. 

Then R is called an algebra with straightening laws (abbreviated ASL) on P 
over k if the following conditions are satisfied: 

(ASL-1) The set of standard monomials is a basis of R as a vector space over k. 
(ASL-2) If a and B in P are incomparable and if 


aB = > ry; (O#r Ek, y,; <6) 


is the unique expression for af in R, as a linear combination of distinct standard 
monomials guaranteed by (ASL-1), then y; < a, B for every i. 

A fundamental theorem on ASL’s states that if P is Cohen—Macaulay (resp. 
Gorenstein) over k then every ASL on P over k is Cohen-Macaulay (resp. 
Gorenstein). From this fundamental theorem, we can obtain much information 
about any ASL on P from the combinatorics of the poset P. 

Since we are interested in ASL’s which are integral domains, we give the 
following definition: 


Definition. A poset P is called integral over a field k if there exists'an ASL 
domain on P U {0} over k, where 6 is a unique minimal element of P U {6}. We 
use the convention that 6 is never an element of P. 

For example, the cycle of length 2n (Fig. 1) is integral if and only if n <4 (cf. 
(2}). 


Next, we consider Gorenstein properties. 
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Fig. t. 


Definition. A Cohen-Macaulay poset P over a field k is called weakly Gorenstein 
over k if there exists a Gorenstein ASL on P over k. 


We remark that if the geometric realization |A(P)| of the order complex A(P) 
of P is a triangulation of a sphere then P is Gorenstein, hence P is weakly 
Gorenstein. 

Our final goal is to solve the following: 


Problem. Characterize the weakly Gorenstein integral posets of rank d — 1. 


In the case of d= 1, the answer is easy, namely, the weakly Gorenstein posets 
of rank zero are the poset of Fig. 2(a) or the poset of Fig. 2(b). Also, in the case 
of d =2, we can find all weakly Gorenstein integral posets. 


Theorem (Hibi-Watanabe [2]). Let k be an infinite field. Then the weakly 
Gorenstein integral posets over k are those shown in Fig. 3. 


However, if d >2 then the situation is very obscure. Now, to attack the case of 
d=3, to begin with, we must find all integral posets P such that |A(P)| is a 
triangulation of the 2-sphere. To do this, we consider the combinatorial problem 
of finding a simple way to classify all posets P for which |A(P)| is a triangulation 
of the 2-sphere. 


Fig. 2. 
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V A DAN 
DCD 


MMW 


(ntarbitrary) (n<4) 


Fig. 3. 


2. Main result 
We now state our main result. 


Theorem. The following statements are equivalent for a poset P: 
(1) The geometric realization of the order complex A(P) is a triangulation of the 
2-sphere. 
(2) P is the face poset of a 2-connected plane graph. 


To explain this theorem, we consider the following poset P (Fig. 4). 

The 1-skeleton of A(P), written A(P)‘”, is regarded as a graph. And the 
geometric realization of A(P) is a triangulation of the 2-sphere (see Fig. 5). 
Therefore, for each 6; (1 <j <6), the number of 2-simplexes in |A(P)|, which 
contain B;, is four. A(P)‘” — {B; | 1<j <6} is regarded as a plane graph whose 
regions are quadrangles (see Fig. 6). 
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: acpyt?-4 a ,(1<j6)} 


Yo Y, 
Fig. 6. 


In the following, we shall construct a graph G(P). The construction is in two 
stages: 
(1) We choose {7,, | 1 <m <4} as the vertex set of G(P). 
(2) For each quadrangle, we draw an edge joining the two vertices of 
{Ym | 1<m <4} in the quadrangle. 


Y 
2 


YQ Y 
Fig. 7. 


4 
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(4) (5) (6) 

a (7) 
(8) (9) (10) (iy) - 
(12) (13) (14) ‘is 

(16) (17) \18) 
Fig. 8. 


The graph G(P) is a 2-connected plane graph and is regarded as a complex, 
where vertices, edges and regions of G(P) are faces of the complex (see Fig. 7). 
The faces of G(P) ordered by inclusion, form a poset. This poset is called the face 
poset of G(P). Since each vertex, each edge and each region of G(P) 
corresponds, respectively, to an element of {y,, |1<m<4}, an element of 
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{B; | 1<j <6} and an element of {a;|1<i<4}, the poset P is the face poset of 
G(P). 

The above process gives the reader the basic idea behind the proof of our 
theorem. Consult (3] for further details. 

Using our theorem, we can obtain a list of 2-connected plane graphs whose 
order complexes have at most 24 maximal faces, see (3]. Using this list K.-i. 
Watanbe proved the following. 


Theorem (Theorem 2.2 in (9]). Let k be a field and P a poset of rank two. Assume 
that the geometric realization |A(P)| of the order complex A(P) is a triangulation 
of the 2-sphere. Then P is integral over k if and only if G(P) is isomorphic to one 
of the following 18 graphs (see Fig. 8). 
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The maximum clique problem is one of the NP-complete problems. There are graphs for 
which a reduction technique exists that transforms the problem for these graphs into one for 
graphs with specific properties in polynomial time. The resulting graphs do not grow 
exponentially in order and number. Graphs that allow such a reduction technique are called 
soft. Hard graphs are those graphs for which none of the reduction techniques can be applied. 
A list of properties of hard graphs is determined. 


1, Introduction 


The maximum clique problem (MCP) is the problem to determine a largest 
clique in a graph G. It is one of the NP-complete problems, for which we refer to 
Garey and Johnson [1]. Graph theoretic terminology will be according to Harary 
(2]. The problem is equivalent to the independent set problem (ISP) in the 
complementary graph G, which is the problem to determine a largest independ- 
ent set. 

We recall that a function f(N), is defined on the set of natural numbers, is 
called O(g(N)) if |f (N)| <m |g(N)| for N = No, where m and No are non-negative 
constants. If a graph G has N points an algorithm has polynomial complexity, or 
is shortly called polynomial, if the number of steps in the calculation is O(N‘), 
where c is a positive constant. 


Definition 1. A graph G is called soft if either 
(i) there exists a polynomial algorithm to solve the MCP for G, or 
(ii) there exists a transformation of the graph, taking only polynomial time, 
into a polynomial number of graphs of polynomial order such that a 
specific property is removed. 
Part (ii) of this definition is the new feature brought forward in this note. 


All polynomials are understood to be in N, the number of points of the original 
graph. In most cases that we shall deal with these transformations involve 
reduction techniques in which the number of points, the order, or the number of 
lines, the size, of the graph that is to be investigated further is reduced. 
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The philosophy behind this concept of soft graph can be made clear by the 
following example. Suppose a graph G is non-connected. Clearly we shall 
consider the components separately in our search for the maximum clique. By 
doing this we investigate graphs that have the property that they are connected. It 
takes only polynomial time to determine the components. There are only O(N) 
components and their orders are O(N) as well. These components may turn out 
to be hard in the sense that no polynomial algorithm is known for them. 
However, the original graph is soft in the sense that it has a property, 
non-connectedness, that certainly is not the deeper reason for the overall 
difficulty of the problem. We may determine the properties that are not the real 
cause for the overall difficulty. Their negations then determine more and more 
aspects of the structure of the graphs for which the problem is really difficult, the 
hard graphs for the maximum clique problem. 

It is important to stress the fact that we are considering certain properties. As a 
result of some decomposition technique there may arise more graphs than one, 
each of which is to be investigated on maximum cliques. Such a decomposition 
not necessarily refers to a specific property. A typical example is the technique of 
Tarjan and Trojanowski [6]. 

Each point v determines a set of neighbours N(v) and we may decompose the 
graph into N graphs (vUN(v)). All cliques are contained in these induced 
subgraphs. It is clear that on repetition of this procedure one finally finds a 
maximum clique. Obviously, as v is adjacent to all other points of (vu UN(v)), v 
may be left out and one needs only determine a maximum clique in (N(v)). 
There is reduction of the order. However, each repetition introduces a factor 
O(N) for the number of graphs that are to be investigated. A combinatorial 
explosion occurs and Tarjan and Trojanowski find an O(2”*) complexity. We 
shall call this procedure direct decomposition. 

Reduction techniques like direct decomposition should be distinguished from a 
decomposition like splitting a graph into its components. That decomposition 
technique does not induce a combinatorial explosion for the simple reason that it 
cannot be repeated. 


2. Soft graphs 
The proofs of the following lemmas are quite simple and are available in a 
more elaborate version of this note. The proofs of Lemmas 2 and 10 are given as 


examples. 


Lemma 1, Non-connected graphs and graphs with non-connected complements 
are soft. 


Definition 2. Point v is said to dominate point w if N(v) > N(w). 
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Lemma 2. Graphs that contain a point that dominates some other point are soft. 


Proof. Finding a point that dominates some other point can be done by 
comparing N(v) and N(w) for all pairs of points v and w, i.e. in polynomial time. 
Let point v dominate point w, then any maximal clique that contains w has a 
counterpart clique in which w has been replaced by v. This clique may be 
maximal as well or may be part of an even large clique. For the search of a 
maximum clique point w may be deleted and the order of the graph is reduced. 
O(N) repetitions of the procedure remove all dominated points. O 


Lemma 3. Graphs with cutpoints are soft. 


This lemma gives an example of a reduction technique in which the order of the 
graphs is reduced, but the total number of points in the resulting graphs, the 
blocks, has increased. As the number of resulting graphs is O(N) the reduction 
technique does not induce a combinatorial explosion. 


Definition 3. Cutgraphs of order k of a graph G are induced subgraphs on k 
points with point sets that are cut sets for G. 


The obvious generalization of Lemma 3 is: 


Lemma 4. Graphs with cutgraphs of order smaller than or equal to a constant k 
are soft. 


A related result is that of Whitesides [7], that in our terminology reads: 
Lemma 5. Graphs with cutgraphs that are cliques are soft. 


If G is the line graph of a graph H, then cliques in G correspond to stars in H. 
Independent sets in G correspond to matchings in H. This was reason for the 
author to study decomposition methods based on turning graphs into line graphs 
(3]. As a result the following was found. 


Lemma 6. Graphs that, by addition of lines, can be turned into a line graph that is 
not a complete graph are soft. 


One of the forbidden induced subgraphs for a line graph is the graph K, 3. This 
graph turns out to be non-relevant for the MCP. In [3] the author used 
detachments to turn graphs without induced subgraphs K,— x into line graphs. 
However, if that line of thought is not followed there are even simpler proofs of 
the following lemma. One technique was given by Dahlhaus, another by 
Schrijver. 
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Lemma 7. Graphs that do not contain K,— x as induced subgraph are soft. 


The importance of graphs like K,— x is that they have a signalling function 
with respect to the fact that a line belongs to only one clique. Whenever a line is 
uni-cliqual this is found out by the fact that the line does not occur as the diagonal 
of a K,—x. Based on a remark of H.J. Veldman we have: 


Lemma 8. Graphs with uni-cliqual lines are soft. 


The reader should note the special case in which a line does not belong to any 
triangle and therefore forms itself a maximal clique. 

Triangles have a signalling function for uni-cliqual points. If all sets of three 
points that contain point v induce a K;, point v is unicliqual. 


Lemma 9. Graphs with uni-cliqual points are soft. 


A, perhaps superfluous, note for the Lemmas 8 and 9 is that the number of 
cliques that is memorized is polynomial. No combinatorial explosion is induced in 
the space-complexity. 


Another important induced subgraph on four ponts is K,U K,, that we shall 
call a point-triangle (configuration). The complement of this graph is K, 3. 


Lemma 10. Graphs without point-triangles as induced subgraphs are soft. 

Proof. If a graph G does not contain point-triangles then G does not contain a 
K,,3 as induced subgraph. A deep result of Minty [4] and Sbihi [5] shows that the 
ISP can then be solved in polynomial time in G, which solves the MCP for G. G 
is soft. O 


This result can be used for a further reduction technique. 


Lemma 11. Graphs with points that have neighbourhood graphs that do not 
contain point-triangles are soft. 


However, a fixed number of point-triangles can be admitted, as we have: 
Lemma 12. Graphs with the property that each point v has in its neighbourhood 
graph (N(v)) a number of graphs K,U K,, that is limited by a constant l, are soft. 
Discussion 


The main point of further research is the extension of the list of properties that 
make a graph soft. One may hope for extension of the list of properties in such a 
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way that one can prove that no graphs can be found that have none of them. This 
would establish the fact that at least one of the reduction techniques is bound to 
succeed at each phase of the reduction process. In a situation like that the 
complexity of the composite algorithm consisting of successive reductions may be 
investigated. Meanwhile a set of hard graphs remains. The order of the smallest 
hard graph known, having none of the properties mentioned in the lemmas, is 12. 
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1. Introduction 


Let n, k and A be positive integers such that there exists a symmetric 
(v, k, A)-design D =(P, B), where P and B denote the sets of points and blocks 
respectively. Further let us assume that there exists a bijection T from B to P 
satisfying the following two conditions: 

(i) For every block a, T(a) ¢ a. 

(ii) If T(@) € B, then T(B) ¢ a. 

Now we define a digraph D=(V, A), where V and A denote the sets of 
vertices and arcs respectively, as follows: V = P, and (a, b) is an arc if and only if 
be T™ (a). 

By properties (i) and (ii) of T, if (a, 6) is an arc then b #a and (6, a) is not an 
arc. Namely D is asymmetric. 

For a vertex a put N*(a)= {be V such that (a, b)¢ A} and N'(a)={beV 
such that (b, a) € A}. Then N*(a) = @ where T(@) =a, and N (a) = {T(a@) such 
that a € aw}. Since D is a symmetric design, we have that |N*(a)|=|N (a)| =k, 
and |N*(a) NN*(b)| =|N (a) NN (b)| =A, where a, be V with a#b. Namely 
D is regular of valency & and doubly regular of double valency A. 

For brevity we call D a DRAD with parameters vu, k and A. 

In this paper we show some basic properties of DRADs and consider the 
existence and non-existence of an above mentioned bijection T for given 
symmetric designs. 


Notation. For a finite set S, |S| denotes the number of elements of S. 


2. Some basic properties of DRADs 
Proposition 1. v = 2k + 1 and hence k=2A +1. 


Proof. If v<2k +1, then N*(a)NN (a)#@, aeV, This destroys the asym- 
metry of D. O 
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Remark. The case where the equality holds in Proposition 1 corresponds to the 
case of Hadamard tournaments. For this see [{1, 4 and 6]. 


Proposition 2. D is strongly connected. 


Proof. Let a be any vertex and C(a) the set of vertices to which there exist 
(directed) paths from a. By the definition of a DRAD it is easy to see that (i) 
there exists a vertex a such that C(a) = V and (ii) V={aeV;C(a)=V}. O 


Proposition 3. Let a be any vertex, N3(a) the set of vertices whose distance from a 
equals two and Nj (a) the set of vertices whose distance to a equals two. Then 
|N3(a)| and |Nz(a)| are not less than (n?/2A) + (4n) — (34/8), wheren=k—A. 


Proof. Let N*(a)=a={bj, bo,...,b,} and b;=T(B;), 1<i<k. Then we 
have that N3(a) =) <i<x 8; - a. Clearly it holds that |,<:<,B;-a|2=xn—- 
(3)A. Put F(x) =xn — (3)A. Then F(x) takes the largest value at x = (n/A) + (4). 
So we have that |NJ(a)|>(n/A)—(4))n — (2)((n/A) — G))((n/A) — @)A = 
(n?/2A) + (3) — (34/8). The proof for |Nz(a)| is similar. O 


Proposition 4. The girth g of a DRAD is equal to at most four. 


Proof. Take any vertex a. We show that N3(a)M(N (a) U N3z(a)) #9. In fact, 
by Proposition 3 we have that |{a}|+|N*(a)| + |NZ(a)| +|N7(a)| + |Nz(@)| - 
v21+2k + (n7/A) +n — (34/4) -—v =(n/A)+n4+(A/4)4+1>0. If Nif(a)N 
(N- (a) UNz(a)) =, then N*(a)N Nz(a) #9. If be N*(a)MNz(a), then there 
exists a vertex c such that (b,c) and (c,a) are arcs. This means that 
c é€Nz(a)NN (a), which is a contradiction. O 


Proposition 5. The diameter d of a DRAD equals at most four. 


Proof. Suppose that d=5. Then there exist six vertices a,,0<i<5, such that 
Q@y— @,— a, 43;— a4— as is a shortest path from ay to as. This implies that 
{ao}, N*(ao), N3(@o), Nz(as), N (as) and {as} are pairwise disjoint. However, 
as in the proof of Proposition 4 we have that 


I{ao}| + |N*(ao)| + NZ (ao)| + I{as}] + IN“ (as) + |N2 (@5)| > v. 


This is a contradiction. O 


Question. Does there exist a DRAD with g = 4 or d = 4? 


3. An existence theorem for DRADs 


Proposition 6. Let D=(P,B) be a cyclic projective plane given as follows: 
P=Z/(v), where Z denotes the ring of rational integers and v=|P| and 


Doubly regular asymmetric digraphs 183 


B={a+i,ieP}, where @ is any block (line) of D. Then we can define a 
bijection T from B to P satisfying (i) and (ii) in Section 1 and 
(iii) T(a + i) = T(a) +i, ie P. 


We call such a bijection T cyclic. 


Proof. We have that v=|P|=n?+n+1 and k=|a|=n+1. Hence we have 
that |a+a|<n+1+("3')<n? for n=4. So if n>=4 let a be any point outside 
a such that 2a is outside a + a. Then define T as follows: T(a +i) =a +i, ie P. 
Apparently T satisfies (i) and (iii). If there exist two distinct elements i and j of P 
such that T(a +i) ¢ a@+j and T(a+/j)€a@ +i, then we have that 2T(a@) € ata. 
This is a contradiction. If n=2 and a={0, 1,3}, then define T as follows: 
T(a+i)=6+i%, ie P. If n=3 and a= ({0, 1, 3,9}, then define T as follows: 
T(a+i)=4+i, teP. 


Proposition 7. Let D=(P, B) be a cyclic symmetric design given as in Proposi- 
tion 6. (We do not assume that }=1). Let a block a satisfy the following 
condition: 0# a and 0# a + a. Then we can define a cyclic bijection T as follows: 
T(a+i)=i, ie P. 


Proof. Obviously T satisfies (i) and (iii). If there exist two distinct elements i and 
j of P such that T(a+i)ea+j and T(a+j)e€a+i, then we have that 
0=(i-j)+(j—i) € a +a, which is against the assumption. 

In general, a symmetric design has more than one bijections T satisfying the 
assumptions in Section 1. 


Example 1. Let us consider a cyclic projective plane with n =3 and v = 13. Let 
a= {0,1, 3,9} be a line. Then define T as follows: T(a)=2, T(a+1)=5, 
T(a+2)=8, T(a+3)=7, T(a+4)=9, T(a+5)=12, T(a +6) =10, T(a + 
7)=1, T(a+8)=6, T(a+9)=11, T(a+10)=3, T(a+11)=0 and T(a + 
12) = 4. It is easy to check that T satisfies (i) and (ii) in Section 1. Obviously T is 
not cyclic. 


On the other hand, not every symmetric design has a bijection T satisfying the 
assumptions in Section 1. 


Example 2. Let D=(P, B) be a cyclic symmetric design with parameters 
v=15, k=7 and A=3 such that {0, 1,2, 4, 5, 8, 10} is a block. Then there exists 
no bijection T for D satisfying (i) and (ii) in Section 1, though it is slightly 
laborious to check this fact by hand. We notice that D is a Hadamard design and 
that the corresponding Hadamard matric of order 16 is of the group type. For this 
see [2]. 
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4. Remarks on adjacency and incidence matrices 


Proposition 8. Let D =(P, B) be a symmetric design, where P = {a,, a2,..., a,} 
and B={a,, @,..., @}, and let A be the incidence matrix of D. If A can be 
chosen so that A+ A' +1 is a (0, 1) matrix, where t denotes the transposition and I 
is the identity matrix, then D admits a bijection T defined in Section 1 and vice 
versa. 


Proof. Suppose that D admits T. Then we may choose A so that the (i, j)-entry 
of A equals 1 if and only if T(a;) belongs to a. Now the property (i) of T implies 
that the diagonal entries of A equal 0, and the property (ii) of T implies that if the 
(i, j)-entry of A equals 1 then the (i, /)-entry of A‘ equals 0 for i #j. It is easy to 
prove the converse, too. O 


Now it is well known that A is normal. See, for instance, [5]. However, we 
would like to remark that it is more than that, namely PAQ is normal for any 
permutation matrices P and Q. We call such a matrix 2-normal. 


Proposition 9. Let A be a real square matrix of order v. Then A is x-normal if 
and only if A is normal and AA‘ = al + bj, where a and b are real numbers, and J 
denotes the all one matrix. 


Proof. If A is normal and AA'=al + bJ, then PAQQ'A'P' = PAA'P' = P(al + 
bJ)P* = al + bJ = Q'(al + bJ)Q = Q'A'AQ = Q'A'P'PAQ. 

Let S=(s,;) be a permutation matrix corresponding to a permutation a, 
namely the (i, j)-entry of S equals 1 if and only if j=o(i), 1<i<v. If A is 
az-normal then S'A'AS = ASS'A'= AA‘= A'A. Put B= A'A=(b6;,). Then the 
(i, )-entry of S'BS equals Y)45;1b) 45k. = 00-\i),0 “ny. SO we have that b= 
ba-\),0-\) for any permutation 9a. 

Now we return to the situation of Proposition 8 and assume that A has the 
property that A + A'+ / is a (0, 1)-matrix. Then define T by T(a;) =a, 1<1<v. 
Then we get a DRAD D and A can be regarded as the adjacency matrix of D. 
Clearly k is a simple eigenvalue of A and the remaining v — 1 eigenvalues of A 
lies on the circle |x| = (k — A)}. 

Finally we would like to mention the following question which is well known in 
the case of Hadamard designs. 


Question. Suppose that a symmetric (v, k, A) design D, exists, where v > 
2k + 1. Then does there exist a symmetric design D, with the same parameters 
such that D, admits a bijection T satisfying the assumptions in Section 1? 
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Let M be a matroid on set E, |E| =m, with rank function r. For a positive integer w, M is said 
to be wth L-ind (C-ind) orderable if there exists an ordering O of E such that any consecutive 
(cyclically consecutive) w etements are independent. It is proved that M is wth L-ind orderabte 
if and only if [m/w] -(w—r(E-S)) <|S|< [m/w] -r(S) holds for any Sc E. White, we 
conjecture that M is wth C-ind orderabte if and only if |S| <r(S) - (m/w) hotds for any Sc E. 
This is verified for several classes. 


1. Introduction 


Let M be a matroid on set E with rank function r. We put |E|=m and 
r(E)=r. 
An ordering O of E is a bijection 


O:E— {1, 2,..., mb}. 
For a positive integer w, O is said to be a wth L-ind (linearly independent) 
ordering if it satisfies the condition: 
(L)A(i) = {O'(k) | kK =i,i+1,...,i+w-1} 
is independent for anyi, 1<i<m—-w+l. 


While if ordering O satisfies the following strengthened condition (C), O is said to 
be the wth C-ind (cyclically independent) ordering. 
(C) Bi) ={O(k) |K=ii4+1,..., i+w—1 (modm)} 
is independent for anyi, 1<i<m. 

If there exists a wth L-ind (C-ind) ordering, M is said to be wth L-ind (C-ind) 
orderable. Note that if M is ith L-ind (C-ind) orderable, then for any j <i M is jth 
L-ind (C-ind) orderable. Furthermore, the dual of an rth C-ind orderable matroid 
is (m —r)th C-ind orderable. 

The purpose of this paper is to characterize these matroids. For the wth L-ind 


orderable matroids, a complete characterization is obtained. The proof is a direct 
extension of the previous work [1], which considers the problem in case w =r. 
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However, for the wth C-ind orderable matroids, the problem looks very tough. 
We can only provide a necessary condition, called (w UNICOVER), though we 
conjecture that it is also sufficient. At present, we verify that the conjecture is 
true for the following cases. 

(1) Graphs consisting of two disjoint spanning trees and any w <r. 

(2) Matroids and w = 1 or 2. 

(3) Simple graphs and w = 3 or 4. 

(4) Complete graphs and any w <r. 

(5) 2-trees and any w <r. 
It must be noted that any matroid in (1), (4), and (5) satisfies (w UNICOVER) 
for any w <r. 


2. wth L-ind orderable matroids 


An independent set of cardinality w (w <r) is called for simplicity a w-set. Let 
p=|m/w], t= [m/w]. The following two lemmas should be called the packing 
and covering theorems [2, 3] applied to the truncation of M to w, respectively. 


Lemma 1. There exist p disjoint w-sets in M if and only if the following condition 
is satisfied: 

(w PACK) ForanyScE,|S|=p-(w-r(E—S)). 
Lemma 2. E is covered with t w-sets if and only if the following condition is 
satisfied : 

(w COVER) Forany ScE, |S|<t-r(S). 


The following lemma can also be proved by an analogous manner as for the 
above lemmas. 


Lemma 3. /f M satisfies both (w PACK) and (w COVER), M is partitioned into p 
disjoint w-sets and one (possibly empty) independent set. 


Now we can give a characterization theorem which provides a polynomial time 
algorithm to recognize the wth L-ind orderable matroids. 


Theorem 1. The following four conditions for M are equivalent: 
(1) Both (w PACK) and (w COVER) are satisfied, that is, for any Sc E, 
p-(w-r(E-S))<|S| <¢-r(S) 


(2) E is covered with t w-sets and contains p disjoint w-sets. 
(3) E is partitioned into p disjoint w-sets and one independent set. 
(4) M is wth L-ind orderable. 
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Proof. It suffices to prove (3)— (4). Let E=AUS,U---US, be the partition as 
in (3) where S; is a w-set and A, 0<|A|<vw, is an independent set. 

We first show a method of ordering A US, such that its consecutive w elements 
are independent. Then the idea is applied to extend the ordering of AUS,US,, 
AUS,US,US;,..., and AUS,U---US,. 

An element which is assigned the kth order is denoted by e,. First of all, we 
give A with arbitrary ordering. Let |A| = q (<w) and A= {e,..., eg}. S;U {e,} 
has at most one circuit and the circuit has non-null intersection with S,, since A 
and S, are both independent. Choose any element from 5S, which is contained in 
the circuit if one exists, and any element otherwise. And let it be e,,,. Then, 
Si =S,U {e,} — {€gaw} is a w-set. S}U {e,-,} has at most one circuit which has 
non-null intersection with S,-{e,,,} since S, and A-{e,,,,} are both independent. 
Choose any element from Si which is contained in the circuit if one exists, and 
any element otherwise. Let the element be e,,,-;. Then, S}=S}U {e,-1}- 
{€g4w-1} Is a w-set. 

Continue this procedure up to get the ordering 


(Openncig Cpawi Son 
q 


The rest, Sy-{@14.- ++» €g+w}, 18 ordered arbitrarily from q + 1 through w. Thus 
we obtain an ordering of AUS,. It is evident that every set of consecutive w 
element is a w-set. 

The above ordering was based only on the facts that A and S, are both 
independent and that |S,|=w. Furthermore, recall that in the procedure we are 
allowed to order A arbitrarily. Thus we see that the same principle applies to the 
ordering of S;U S;,, when S; is ordered and S;,, is not @=1,...,p—1). Thus, 
the ordering is extended to AUS,U---US,. O 


3. wth C-ind orderable matroids 


If M is wth C-ind orderable, there exist m w-sets which cover E such that each 
element is covered exactly w times. 

Consider a matroid M“ = (E™, r™) which is derived from M by replacing 
each element with w parallel elements. Note that r(S™), S cE™), is equal 
to r(S) where Sc E consists of the elements whose corresponding elements in 
M™ are in S). Then, if M is wth C-ind orderable, M“” is covered with m w-sets 
and also contains m disjoint w-sets. Therefore, by Lemma 1 and 2, 


E|/w | : (r?™(E™) = rE = s™)) < \s™| < JE |/w] : (7 (S™)). 
holds for any S™ c E™. Since |E| = wm, this leads to 
(m/w) -(r —r(E — S)) <|S|<(m/w)-r(S) for any Sc E. 


In contrast to the linear case, the above two inequalities are equivalent [4]. Thus 
we get a necessary condition of the wth C-ind orderable matroids which is able to 
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be checked in polynomial time [2]. We believe that this condition is also 
sufficient. Indeed, we have various affirmative examples for the conjecture, which 
we describe in primal form in the following. 


Conjecture. A matroid M is wth C-ind orderable if and only if 
(w UNICOVER) |S|<(m/w)-r(S) for any SCE 


is Satisfied. 


Note that M_ satisfies (r UNICOVER) if and only if its dual satisfies 
(r UNICOVER). 

In the following, we list several matroids satisfying the condition for which we 
have been able to give wth C-ind orderings. 


4. Examples of wth C-ind orderable matroids 


For a graph G=(V,E), G—S, ScV, is the graph obtained from G by 
deleting S. E(G) also denotes the set of edges of G. An edge connecting vertices 
v and u is denoted by (v, u). 

A graph consisting of two disjoint spanning trees is called a CTS-graph 
(complementary tree structure graph) and one of the trees is called a peripheral 
tree. It is easy to see that any CTS-graph satisfies (vw UNICOVER) for any 
wsr. 


Theorem 2 (1]. A CT7S-graph is wth C-ind orderable for any w <r. 


Proof. It suffices to prove the theorem for w =r. We shall apply induction on r. 
If r =1 then the theorem is obviously true. (See Theorem 3.) Suppose that r > 2. 
Note that G contains a vertex of degree 2 or 3. If there exists a vertex of degree 
2, say v, G-{v} is again a CTS-graph. By induction hypothesis, there is an 
(r—1)th C-ind ordering O' in G-{v}. Define an ordering O of edges of G as 
follows. 


O'(e) (e € E(G — {v}) and O'(e) <r—1) 

ale (e € E(G — {v}) and O'(e) =r) 
(e is one edge incident to v) 

s (e is the other edge incident to v). 


O(e)= 


It is easy to see that any cyclically consecutive r edges under O are independent. 

Suppose that there is a vertex of degree 3, say uv. Let x, y, and z be the vertices 
adjacent to v. Two of them may be identical. Assuming that (x, v) and (y, v) are 
contained in a peripheral tree, let G’ be the graph obtained from G by removing 
vertex u and adding an edge (x, y). G’ is also a CTS-graph. Hence there is an 
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(r—1)th C-ind ordering O’. Without loss of generality, we can assume 
O'((x, y))=1. Let P= {O'~'(2), O'-'(3),..., O' (r-1)}. Since PU {(x, y)} 
is a spanning tree of G’, exactly one of x and y, say x, is connected to z in P. 
Now define an ordering O of edges of G as follows. 

O'(e) (e € E(G') and2<O'(e)<r-—1) 

O'(e)+1 (e€ E(G’) and O'(e) =r) 


O(e)=41 (e =(v, a)) 
r (e =(u, c)) 
2r (e =(v, 5)). 


It is easy to see that O is an rth C-ind ordering. O 


Since it is trivial that matroid M satisfies (1 UNICOVER) if and only if M 
contains no loop, the following fact is evident. 


Theorem 3. A matroid M is 1st C-ind orderable if and only if M satisfies (1 
UNICOVER). 


Since it is easy to see that matroid M satisfies (2 UNICOVER) if and only if 
r2=2 and any ScE such that r(S)=1 contains at most 4m elements, it is not 
difficult to prove the following theorem. 


Theorem 4. A matroid M is 2nd C-ind orderable if and only if M satisfies (2 
UNICOVER). 


Lemma 4. A simple graph G satisfies (3-UNICOVER) if and only if r=3 and G 
is not the graph with m = 4 containing a triangle. 


Proof. The necessity is obvious. Suppose that G does not satisfy (3- 
UNICOVER). If r =3, there exists S such that 


|S| > (m/3) - r(S). 


Since m=|S|,22r(S)21. If r(S) is 1 or 2, then |S| is at most 1 or 3, 
respectively. By the inequality, m<2 or m<4, respectively. From r >3 and 
m2r+\|S|—r(S), only the case when the above inequality holds is m= 4, 
r(S) =2, and |S|=3. O 


Theorem 5. A simple graph G is 3rd C-ind orderable if and only if G satisfies (3 
UNICOVER). 


Proof. The theorem is obviously true when r = 3. Furthermore, it is not difficult 
to check that every simple graph is 3rd C-ind orderable if r = 4. With using this 
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fact as basis, we prove the theorem by induction on r. Suppose that r(G)=5, 
|\V|=n=6. Let 6 be the minimum degree of G. Then, 6 <4m since 6n <2m. 
Let u be a vertex of degree at most 4m. By the induction hypothesis, there exists 
a 3rd C-ind ordering in G-{v}. We can insert the edges which are incident to uv 
into this ordering so that no pair of these edges are within distance 3 to get a 3rd 
C-ind ordering of G. O 


The following lemma seems not so trivial but the proof is omitted here for the 
space. 


Lemma 5. A simple graph G satisfies (4 UNICOVER) if and only if r=4 and G 
does not contain K,; if m=5, K,,2 if m=6, or K, if m=7 as an induced 
subgraph. 


Theorem 6. A simple graph G =(V, E) is 4th C-ind orderable if and only if G 
satisfies (4 UNICOVER). 


Proof. It is so cumbersome to describe our proof in detail that we will sketch it. 
First, we prove that any connected simple graph with 7 vertices is 4th C-ind 
orderable, almost through the exhaustive way. Then, we can prove the fact that 
any connected simple graph which has more than 7 vertices is 4th C-ind 
orderable, completing the proof. This key fact is proved by induction on n as 
follows. Suppose that G has n (>7) vertices. From 6n <2m and n28, it is 
concluded that G contain a vertex of degree at most 4m, say v. By induction 
hypothesis, the edges of G-{v} can be so ordered that any cyclically consecutive 4 
edges are independent. We can insert the edges which are incident to v into this 
ordering to get an ordering of E so that no pair of these edges are within distance 
4. 0 


It is easy to see that a complete graph satisfies (r UNICOVER). Illustrative 
examples in Fig. 1 and 2 showing a way of rth C-ind ordering will be the proof of 
the following theorem. 


Theorem 7. A complete graph is wth C-ind orderable for any w <r. 

For a positive integer k, a k-tree is the graph defined recursively as follows: (1) 
K,, is a k-tree, (2) a graph obtained from a k-tree T, by adding a vertex v and 
edges that connect v and the vertices forming a K, in T, is a k-tree. 


Lemma 6. A k-tree satisfies (r-UNICOVER). 


The proof is omitted here for reasons of space. But for the case of 2-trees, the 
following theorem implies the lemma. 
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Fig. 1. An rth C-ind ordering of Kg. 


Theorem 8. A 2-tree is wth C-ind orderable for any w <r. 


Proof. We shall apply induction on r. If r=1, that is‘K2, then our theorem is 
true. Suppose that r > 2. By the definition of 2-tree, we can find a vertex v which 
was added at the last stage of construction. Let u and w be the vertices which are 
adjacent to v. G-{v} is a 2-tree of rank r — 1, and so is (r — 1)th C-ind orderable. 


Fig. 2. An rth C-ind ordering of K3. 
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Let O' be such an ordering and O'((u, w)) = 1. Define an ordering of edges of G 
as follows: 


1 (e=(u, w)) 
2 (e =(u, v)) 

O(e)=4 O'(e)+1 (ee E(G— {v}) and2<O'(e)<r-—1) 
r+1 (e =(w, v)) 


O'(e)+2 (e€ E(G — {v}) andr <O'(e) < 2r —3). 


Every set of cyclically consecutive r edges under O that contains just one of (u, v) 
and (w,u) is independent. The only exception is set S={(u,v}, 
O7'(3),..., O-'(r), (w, v)}. This is also independent because SU {(u, w)} — 
{(u, v), (w, v)} is an (r — 1)-set of G-{v}. O 
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1. Introduction 


A school timetable problem is to find an efficient algorithm for constructing a 
school timetable which attains various given requirements consistently. Although 
many theoretical models (Cole {4], Csima [5], Gotlieb [9], Ikeda {13], Neufeld 
{24], Salazar [25], Welsh {26]) and construction algorithms (Almond [1], Csima 
{6], Dempster {7], Ikeda [10], Kirchgassner {20], Lions {21], Yule [29]) of school 
timetables have been proposed, the problem is still an unsolved one in both 
practical and theoretical point of view. 

In theoretical sense, those models are not sufficient to represent various 
complicated requirements for school timetables. Even if almost all requirements 
could be represented in a model, the models were so complicated that it was very 
difficult to find an efficient and simple algorithm. There are some theoretical 
solutions of timetable problems under very limited requirements. But it is 
nonsense in a practical point: of view, because schools could not accept such 
timetables in the actual education. 

On the other hand, there are some computer programs that help to construct a 
school timetable. Alsmost all of these programs simulate traditional manual ways 
and cannot investigate the constructability of a timetable in advance. 

In this paper, we propose a new combinatorial problem including a school 
timetable problem and give some theoretical solutions. The problem seems very 
simply but general because it also includes a network flow problem, a graph 
coloring problem and some other combinatorial problems as a special case of it. 


2. School timetable problem 


A school timetable problem was formalized in various manners. But almost all 
formalizations are too simple to accept a timetable constructed with them for 
actual schools. From the practical point of view, an actual school timetable 
problem should be formalized with the following complexity. 
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Definition 1 (Specification of timetable problem). A school timetable problem is 
an 11-tuple (7, C, W,p, E, L,g,h,r,1, Ao) which satisfies the following 
conditions: 
(1) Tis a finite set which is called the set of all teachers, 
(2) C is a finite set which is called the set of all classes, 
(3) W is usually a set {MON, TUE, WED, THU, FRI} and sometimes 
includes “SAT”’. It is called the set of all names of days in a week. 
(4) p is a function W— {1, 2,...}, where p(w) means number of periods in 
day w, 
(5) E is a finite set which is called the set of all special rooms, 
(6) L is a subset of (27 — {}) x (2° — {}) x (2*) x (1, 2, ...} which is called 
the set of all different lessons, 
(7) g is a function from T X W to {1,2,...}, where g(t, w)=m means that 
teacher ¢ can give upto m lessons on day w, 
(8) his a function from E to {1,2,...}, which defines capacity. of each special 
room, 
(9) r is a function from L to {1,2,...}, where r(/) =m means that a lesson / 
should be given m times in a week, 
(10) 1 is a subset of TX CXEXW-x {1,2,...}, which gives the set of all 
combinations to be inhibited, 
(11) Ag is a subset L x W x {1, 2,...}, where (J, w, p) € Ag means that lesson 
[should be given from period p on day w. 


In a high school (in Japan), each class has each class room. Set E is the set of 
all special rooms, e.g. music rooms, labs, gymnasiums other than ordinary class 
rooms. An ordinary lesson is specified by a teacher, a class and an ordinary class 
room Or sometimes a special room. 

A lesson which has two or more teachers and/or classes is called a set of 
parallel lessons, or simply a para-lesson. An example of a para-lesson / is a lesson 
of art that consists of three lessons, that is, music /, by teacher ¢, using music 
room eé;, painting /, by teacher f using painting room e2, and writing J, by 
teacher ¢, using writing room e3. Each student of two classes c, and c, takes one 
of these three lessons. In order that lesson / makes no effect on timetable 
construction based on assignment of each class-teacher combination to a period, 
three lessons (¢,, c1, €1), (t2, C2, €2) and (¢3, c,, €,) must be assinged at the same 
period where c, is a dummy class introduced imaginary. In such a case, a lesson 
includes two or more teachers, classes and special rooms as a result. 

An ordinary lesson occupies one period. There may be a special lesson which 
occupies two or more consecutive periods. Such a lesson is called a long-lesson. 
Number of periods occupied by a long-lesson is specified by the last component of 
l. If the last component is 1, it means that the lesson occupies one period. 

There may exist requirements that cannot be expressed within the above 
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framework. For example, a period before a lession of physics must be opened for 
a certain teacher, because the period is sometimes used for preparation of 
experiment. Another example is a lesson of cooking which should be assigned to 
4th or 5th period (immediately before or after lunch time) because students 
would have dishes made by themselves. These requirements are, however, 
relatively minor and there are many other cases when these can be avoided by 
some pre-assignments in Ag or pre-inhibitions in /. But in a practical school 
timetable construction, it is at least necessary to formalize as Definition 1. 

In terms of timetable problem defined above, a solution, called a timetable, can 
be defined as follows: 


Definition 2 (Solution of timetable problem). A solution of a timetable problem 
(T, C, W, p, E, L, g, h, 7, I, Ao) is a subset of A of Lx Wx {1,2,...}, 
satisfying the following conditions: 

(1) If (4, ¢, e, n, w, i) isin A theni+n—1<p(w), 

(2) If two elements of A in the same day w, say (t,c,e,n,w,i) and 
(t’, c’, e’, n’, w, i’) which include common teachers, i.e. 1 t' #9, then 
two period sets {i,i+1,...,i+n—1} and {i',i'+1,...,i'+n'—1) are 
disjoint, 

(3) If two elements of A in the same day w, say (t,¢,e,n,w,i) and 
(t', c’, e’, n', w, i’) which include common classes, i.e. cNc’ #9, then two 
period sets {i,i+1,...,i+n-—1} and {i’,i'+1,...,i'+n'—1)} are 
disjoint, 

(4) For each teacher ¢ and each day w, number of periods occupied by lessons 
that teacher ¢ gives in day w is less than or equal to g(t, w), that is, 

n<g(t,w), 
(¢,c,e,n,w.ijeA and tet 

(5) For each special room e and each period j of day w, number of lessons 

which use room e at period j is less than or equal to hA(e), that is, 


l{t,c,e,n,w, i)€e A:eeeand je {i,it+1,...,i+n—-1}|<h(e), 


(6) For each lesson / = (t, c, e, n) in L, number of elements in A which support 
it is equal to r(f), that is, 


[{(t, c, e, n, w, i) € A}|=r(t, ¢, e, n), 


(7) For each element (¢, c, e, w, i) in J, there is no element (f, ¢, e, n, w, i) of 
A such that re€ t, cee ande€e, 
(8) Set Ag is a subset of A. 


If a timetable problem specified in Definition 1 would be solved completely 
within a limited time, the timetable could be accepted in actual schools. The 
formalization is, however, too complex to find simple and efficient algorithms. 
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3. An existential problem of an n-dimensional weighted marginal-bounded 0-1 
matrix and Gotlieb’s model of a school timetable problem 


In order to formalize a timetable problem, Gotlieb {9] proposed a com- 
binatorial problem as follows: 


Definition 3 (Specification of nWMP). An existential problem of an n- 
dimensional weighted marginal-bounded 0-1 matrix is a 3-tuple (M, a, b) 
satisfying the following conditions: 

(1) M is a function from No to N, 


(2) a is an n-dimensional vector (a,, a2,...,4,) Of function a; from N, to N 
fori=1,2,...,A, 

(3) b is an n-dimensional vector (b,, b2,...,6,) of function b; from N; to N 
fori=1,2,...,A, 


where N = {0, 1,...}, 
No={1, 2,...,m,} X {1,2,...,mg}X-+->X{1,2,..., ma}, 
N,= {1, 2,..., mi} X {1, 2,..., ma} X-++-X {4, 2,..., m1} 
x {1,2,..., mar} x-->X (1, 2,..., m,}- 


The existential problem of an n-dimensional weighted marginal-bounded 0-1 
matrix is also referred to nWMP in short. 


Definition 4 (A solution of nWMP). A solution A of an nWMP (M, a, b) is a 
function from M to {0,1} which satisfies the following conditions: 


a(x)< > Ay): M(y)<6(x) 


veN((i)=x) 

for any x in N, and any jin {1,2,...,n}, 

where N{(i) =x} = (Qa, ..-  Xj-1, %, Xian, +--+ Xn) E No: 
(41, - 6+) Mina, Xian, Xn) EN}. 


Function M is called a weight function and vectors a and 6 are called a lower 
bound and an upper bound respectively. A special case when M(x) =1 or O for 
any x in No is sometimes referred to as nMP. 

We shall give an example of nWMP in a case that n = 2, m,=8 and m,=5. A 
weight function M and bounds a and 6 are given in Fig. 1, where (i, j)th element 
is m, means M(i, j)=m and ith element of the column (or row) vector named 
“a” and ‘“‘b” indicates a,(i) (or (a2(i)) and 6,(é) (or b2(i)) respectively. A solution 
A of the problem is also given in Fig. 1. 

In a case that n=2, nWMP is solvable by embedding it into a network flow 
problem. But if 2 >2, nWMP and even nMP and now unsolved problems. 
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M 123.4 5 ab A 123 45 
1 02000 02 1 01000 
2 10201 2 2 00001 
3 11000 11 3 10000 
4 30010 01 4 00010 
5 15001 1 5 5 01000 
6 1001 2 1 3 6 00010 
7 10000 01 7 10000 
8 013 1 4 1 3 8 00010 
a 27021 

b 3823 4 

Fig. 1. An example of nWMP and its solution. 


Gotlieb [9] pointed out the relationship between 3MP and a timetable problem as 
follows: 
Let (T, C, W, p, E, L, g,h, r, 1, Ao) be a timetable problem that satisfies the 
following conditions: 
(1) Each lesson is an ordinary one, i.e., 
{Le (t,¢, e,n):|t|=1, |e|=1, |e] =0,n =1}, 
(2) There is no constraint on number of lessons in each day w for each teacher 
t, i.e. 
g(t, w)=p(w), 
(3) There is no constraint on number of lessons using each room e at each 
period, i.e. 
h(e)=|Cl, 
(4) There is no pre-assignment, i.e. 
Ao i, qd. 


Such a timetable problem is assumed to be simple in this paper. 


Theorem 1. A simple timetable problem can be embedded into 3MP. 


Proof. Assume that sets T, C and {(w, i): we W, ie {1,2,..., p(w)}} (say, P) 
are integer sets, {1,2,...,mr}, {1,2,...,mc} and {1,2,...,mp} 
respectively. A function M from T x C x P to the set of all integers is defined as, 
0 if(,c, Del, 


MNGi) { otherwise. 


Lower bound a=(a;, a2, a3) and upper bound b=(b,, 62, 63) are given as 
follows: 

a,(c, 7) =0, a(t, c) are r(t, c), a3(t, }) =0, 

b(cf=1, blt,c)=r(tc), bs, y= 1. 
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Then the 3-tuple (M,a,b) is a 3MP. If there is a solution A of the 3MP 
(M, a, b), it is easy to check that A satisfies all conditions in Definition 2, that is, 
atimetable. O 


The timetable problem discussed above is a special case of that in Definition 1, 
because there are no considerations of para-lessons, long-lessons, special rooms, 
upper bound function g and pre-assignments in Ag. Even if a timetable problem is 
restricted in such a way, it is unsolvable at present. 

If we would intend to treat a general case of timetable problem within a 
framework of nWMP, the formalization of the problem would be so complicated 
that we could not find a simple and efficient algorithm. Therefore, we need a 
simple formalization which every requirement for timetabling can be represented 
uniformly. In order to do this, we propose a combinatorial problem which is 
referred to as an existential problem of a weight-controlled subset or WSP in short. 
We shall discuss the problem WSP and relationship between WSP and timetable 
problems in the followings. 


4. Existential problem of a weight-controlled subset and some results 


Definition 5 (Specification of WSP). An existential problem of a_ weight- 
controlled subset is specified by a S-tuple (U, S, w, a, b) satisfying the following 
conditions: 
(1) U is a finite set which is called the universe, 
(2) S is a collection of subsets of U, i.e. § <2”, which is called a condition set, 
(3) w is a function from M(U, S)= {(u, s):ue U, s eS, ues} to the set of all 
positive integers, which is called a weight function, 
(4) a is a function from § to non-negative integers, which is called a lower 
bound function, 
(5) b is a function from S$ to non-negative integers, which is called an upper 
bound function. 


Definition 6 (Solution of WSP). A solution of WSP (U, S, w, a, b) is a subset A 
of U satisfying the following condition: 
a(s)< > w(u,s)<b(s) for anysinS. 
ueAns 

Fig. 2 gives an example of WSP in the case that S=({s,,s,,s;}, U= 
{U,,Uz,..., Ug}, a(S) =6, a(s2) =3, a(s,;)=6, b(s,)=7, b(s2)=7 and b(s,) = 
8. Each element of S is indicated by the set of all non-zero elements in the 
corresponding column of the matrix, e.g. 5; = {u,, U2, U3, 4, Uo}. The matrix 
also indicates a weight function a, that is, (u;, s;) = (i, j)-element of the matrix. A 
solution A is given as last column of the matrix. 

A WSP has never been solved completely. If the number of elements in set U 
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@ Ss s Sy S53 A 
U 

uy 2 0 0 0 
uy 1 2 0 0 
uy 3 2 0 1 
uy 1 1 4 1 
Us 0 3 0 0 
Us 0 1 2 1 
us 0 0 2 0 
Us 0 0 1 0 
Uy 2 0 1 1 
a 6 3 6 

b 7 7 8 


Fig. 2. Example of WSP. 


or § is small, we may find solutions exhaustively by a computer. Unfortunately, U 
and S in almost all of practical cases, however, have relatively many elements. If 
we can reduce the size of U and/or § without any change of solvability of the 
problem, we may use the exhaustive algorithm. The following definitions and 
some results enable us to use the approach. 


Definition 7 (Complete subproblem of WSP). WSP (U, S, w, 4, b) is called a 
complete subproblem of a WSP (U', S', w', a’, b’) iff the following conditions 
are satisfied: 

(1) U is a subset of U’, 

(2) S is a subset of S’, 

(3) w’ is a restriction of w on M(U’, S’), i.e. w' = @lyu-s-, 

(4) If a WSP (U, S, w, a, b) has a solution then WSP (U', S', w', a’, b’) also 

has a solution. 


If a WSP P is a complete subproblem of a WSP P’, then P’ is called an 
expansion of P. 


Definition 8 (Trivial subset). Let (U, S, w, a, b) be a WSP and s be an element 
of S. If a(s) <0 then it is said that s has no lower bound. If b(s)> Yi., w(u, S) 
then it is said that s has no upper bound. If a(s) <0 and b(s)>¥,., w(u, s) then 
s is called a trivial subset. 


Proposition 1 (Reduction of trivial subset). Let (U,§, w, a,b) be a WSP and s 
be a trivial subset. A WSP (U, S', w', a', b') that is derived by the removal of s 
from the WSP (U,S, w,a,b) is a complete subproblem, where S' =S— {s}, 
w' =a) mcu.sy, 4 = als and b' = bly. 


Since a proof of Proposition 1 is trivial, we shall show only an example that 
includes a subset s, without lower bound, a subset s, without upper bound and a 
trivial subset s3 in Fig. 3. 
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@ Ss, Sz $3 
U 

uy, l l 1 
uy 2 l 3 
U3 l 0 0 
uy 0 2 l 
a 0 2 0 
b 2 5 5 


Fig. 3. Example of WSP including a trivial subset. 


Proposition 2 (Expansion by intersection). Let (U,§, w, a,b) be a WSP and 
S,,...,8% be k elements of S. A WSP (U,S',w',a',b’) that is derived by 
intersection of subsets is an expansion of the WSP (U, S, w, a, b) where 


S'’=SU {s’}, 
_ fou, s)ifseSandues, 
w'(u, 2) ole s):ueés,i=1,2,...,k}ifs=s' andues'’, a 
a(s)ifseS, 
a'(s)= max({a(s;))— >, w(u,s,):i=1,...,k}U {0}) ifs=s’, ee) 
ue(s,—s‘) 
b(s) ifs eS, 
hGH (4.3) 


min({6(s,) + Dd (max (ou, 5:7 =1, 2,...,} — ou, s)) 


ues’ 


c=1; Be edk tO} w(u, s)}) ifs=s', 


ues’ 


where s' =(\_, §;. 


Proof. The following equality is a direct sum decomposition of set s;. 
s,=(s;—s')®s’ (4.4) 
Let A be a solution of WSP (U, S, w, a, b). From (4.4), we have 
a(s)< > w(u,s;) 


ueAns, 
= Ss w(u, s;) + Sy w(u, §;). (4.5) 
ue AM(s,—s‘) ueANs’ 


On the other hand, for any i= 1, 2,...,k, 
a(s;) — S w(u, s;) 


ue(s,—s’) 


<a(s)— > (us) 


ueAN(s;—s’) 
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< D> (us) _ by (4.5) 


ueAns’ 


< > max{(u,s)):i=1,2,...,k} 


ueANns' 


= > o'(u,s’) by (4.1). 


ueAns’ 


Therefore, we have 


a'(s')= max( {a(s,) - » ow(u, s;):i=1,2,..., k} U (0}) 


ue(s;—s') 


< ¥ o'(u,s’). 
ueAns' 
On the function b’, we can also get the condition in the following: From (4.4) 
and the definition of WSP, we have 


+ of s)= dS oys)- DY ous) 


ueAns' ueAns, ueAN(s;—s') 
< > w(u, S;) 
ueANs; 
<= b(s;). (4.6) 
By adding Yueans: (max{w(u, s;):j =1,2,...,k}—w(u,s,)) to both most left 
and right sides of inequality (4.6), 
b(s)+ >, (max{o(u, s):7=1,2,...,k} — w(u, s;)) 


ueAns’ 


> wu, s;)+ Ss (max{o(u, s)):j=1,2,...,k} — w(u, s;)) 


ueANns' 


S (max{o(u, s)):f=1,2,...,k}) 


ueANns' 


>» '(u, s’) 


ueANns’ 


WV 


for anyi=1,2,...,k. 
Hence we get the inequality that a function b’ should be satisfied. O 


As an application of Proposition 2, we can give the following corollaries. 


Corollary 2.1. [f the set S includes an element s which can be represented by a 
combination of some other elements in § with intersections and has less strict 
bounds than that generated by the combination, removal of s from S makes no 
effect on solvability of the problem. 


Corollary 2.2. A WSP (U, S, w, a, b) has a solution A iff the closure of the WSP 
has a solution A, where the closure of a WSP (U,S,w,a,b) is a WSP 
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(U, S', w', a’, b’) as follows: 
(1) S’ is the closure set of S by intersection, i.e. S' satisfies the following 
conditions: 
(1) S is a subset of S’, 
(2) If sand s' are in S' then an intersection of s and s’ is also in S', 
(2) w', a' and b' are expansions of w, a and b respectively in the way of 
Proposition 2. 


Definition 9 (Inconsistency). A WSP (U, S, w, a, b) is said to be inconsistent if 
there is an element s in S’ that a’(s)>b'(s), where (U, S’, w', a’, b’) is the 
closure of (U, S, w, a, b). 


We shall give an example of inconsistent WSP in Fig. 4. By column-wise 
investigations, the problem seems to have a solution, but we can recognize that it 
has no solution by making a new column s,s. 


@ AY Ss, Sy 5, Ms, 
U 

uy l 0 0 
uz 3 2 3 
U3 2 2 2 
Uy l l l 

Us 0 2 0 

a 6 l 5 

b 6 3 4 


Fig. 4. Example of an inconsistent WSP. 


Corollary 2.3. If a WSP (U, S, w, a, b) is inconsistent then it has no solution. 


The authors conjecture that the converse of Corollary 2.3 is also true, i.e. 
If a WSP has no solution then it is inconsistent, 


but we have never solved it. 
We have discussed about reduction of columns above and now we shall show 
how to reduce rows of the matrix M(U, S). 


Definition 10 (Deterministic element). Let u be an element in the universe set U 
of a WSP (U,S,@,a,6). u is said to be a deterministic element when the 
following condition is satisfied: 


There are no solutions A and A’ such that u is in A and u is not in A’. 


Definition 11 (Dependent element). Let u and u’ be elements in the universe set 
U of a WSP (U, S, , a, b). wis said to be positively (negatively) dependent on u' 
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@ Ss, 
U 

uy 2 
uy 1 
uy 1 
uy 1 
a 1 
b 3 


Fig. 5. Example of deterministic elements and dependent elements. 


when the following condition is satisfied: 


u is in A iff u’ is (not) in A for any solution A of (U, S, w, a, 6). 


—- NRO 


1 


~ ONr Oo 


2 


coco .,--_ 


oro oO 
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In Fig. 5, u, is positively dependent on u2 and negatively dependent on u3. u, is 
a deterministic element. {A,, A>} is the set of all possible solutions of the WSP. 

Deterministic elements and/or positively dependent elements can be eliminated 
without effect on the existence of a solution of a WSP. The following two 


propositions give how to do it. 


Proposition 3 (Reduction of deterministic element). Let (U,S, w, a,b) be a 
WSP including a deterministic element u. If a WSP (U', S', w', a’, b') derived by 


the removal of u is a complete subproblem of the WSP, where 


U'=U—{u}, 
S' ={s— {u}:s€S}, 


Ty soos 
o = w|ucu's’)> 


Jiey= se ifs'=s, 
b(s) ifs'=s, 

b'(s')= 

es lo otherwise. 


Proof. If A’ is a solution of (U’, §’, w’, a’, b’), we have 


a'(s')< > w'(u',s')<b'(s’) for anys’ €S’. 


u'eA'Ns' 
Let A be a set as follows: 


_ fA’ if v(u) =0, 
vane if v(u) =1. 


max(a(s) — u(u) - w(u, s), 0) ifs’ =s — {u}, 


min(b(s) —u(u): w(u, s), my w(u', s’)] ifs'=s— {u}, 


1 if there exist a solution A such that A 3u, 


(4.7) 
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If v(u) =0 or s’ =s, then inequality (4.7) derives the inequality as follows: 


a(s)< > w(u',s)<b(s) for anyseS’. (4.8) 


u'eANs 


If v(u) =1 and s’ =s — {u}, then inequality (4.7) can be reformed as follows: 
a(s) — w(u, s) <= S w(u', s) <= b(s) — w(u, s). (4.9) 
u‘e(A—{u})N(s—{u}) 
By adding w(u, s) to each term of inequality (4.9), we have 
a(s)< 2 w(u',s)<b(s). (4.10) 


Then A is a solution of the WSP (U, S, w, a,b). O 


Proposition 4 (Reduction of positively dependent element). Let (U, S, w, a, b) 
be a WSP and u and u' be elements of U. If u is positively dependent on u' and a 
WSP (U',S', w', a’, b') derived by the removal of u' is a complete subproblem 
of the WSP, where 


U' =U — {u'}, 
S' = {s— {u'}:s eS}, 

yon nv fou", s) ifu"#u, 
KU i 6 SE ene ifu"=u, 
a'(s’) =a(s), 
b'(s') = b(s). 


Proof. If A’ is a solution of (U’, S’, w’, a’, b'), we have 
a'(s')< > w'(u",s')foranys’€S. (4.11) 


u"eA'Ns® 


Let A be a set as follows: 


aa if A’ 3u, 

A’ otherwise. 

If A’ 3 u then, inequality (4.11) can be reformed step by step as follows: 
a(s)< yy w'(u", s)<b(s), 

“u"e(A—{u'})N(s— {u’}) 


a(s)< Ss w(u",s)+ w(u', s)— w(u', s)<b(s), 


uve ANs 


a(s)= > w(u",s)<d(s). 


u"eANs 
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@ S 3s, So 53 A, Az 
U 

u, 3 1 1 1 0 
U3 1 1 2 0 1 
a 1 1 1 

b 3 1 2 


Fig. 6. Result by application of Propositions 3 and 4 to a WSP in Fig. 5. 


If A’ 3 u then, from inequality (4.11) we have 
a(s)= wu", s)<d(s). 
u’eANs 


Then A is a solution of the WSP (U, S,@,a,b). O 
The reduced WSP corresponding to the WSP given in Fig. S is given in Fig. 6. 


5. Relationship among a timetable problem and a WSP 


In this section, we shall show that any timetable problem can be embedded into 
a WSP. It means that a WSP is more general than a timetable problem. Although 
both problems have never been solved, a WSP is simpler than a timetable 


problem. 

In order to show that a timetable problem (T, C, W, p, E, L, g, h, r, I, Ao) can 
be embedded into a WSP, we shall make a WSP (U,S, , a,b) from the 
timetable problem in Lemma 1 and show that a timetable can be constructed 
from a solution of the WSP in Lemma 2. 


Lemma 1. A WSP can be constructed from a timetable problem. 


Proof. Let (T, C, W, p, E, L, g,h, r, I, Ao) be a timetable problem. Sets U and 
S, and functions w, a and b are defined by the following formulas: 


U=LxWx{i,2,...} 


— {(t, c, e, n, w, i):(t, c, e, w, i) el, tet, cec,e€e} 


— {(t, ce, n, w, i):n+i-—1>p(w)}, (5.1) 

S=§,US,US,US,US,US, (5.2) 
( = ifie {1, 2, 4, 5, 6} andues,, 
ST ifi=3anduess, 
a b 

s(t, w, J) 0 1 
s,(c,w,j) 0 1 
53(t, w) 0 g(t, w) 
s,e, w, J) 0 h(e) 
s5(J) 


r 
Se 1 
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where 
S, = {s,(t, w, j):te€T, we W, je {1,2,..., p(w)}}, 


S, = {s.(c, w, j):c eC, we W, je {1,2,..., p(w)}}, 

S; = {s3(t, w):t€T, we W}, 

S,= {s,(e, w, J):e €E, we W, je {1,2,..., p(w)}}, 

S; = {s5(/):le L}, 

Ss = {56(t, c, e, n, w, 1): (f, ¢, e, n, w, i) € Ao}, 

s(t, w,j)={(t,c,e,n,w,i)eU:tet, je {i,it1,...,itn—-1}}, 
s.(c, w, j)={(t,c,e,n, w,i)eU:cec, je {i,it+1,...,i+n—1}}, 
53(t, w) = {(t, c,e,n, w, i)E Ute t}, 

s,(e, w, j)={(t, ce, n, w, i)e U:eee, jef{i,it1,...,itn—-1}}, 
s,(1) = {(t, ce, n, w, 1)e U:l=(t,c,e,n), weW, ie {1,2,..., p(w)}}, 
S6(t, c, e, n, w, i} = {(f, c, 6, n, w, i)}. 


It is obvious that the 5-typle (U, S, w, a,b) isa WSP. O 


Lemma 2. [f there is a solution of a WSP derived from a timetable problem in the 
way of Lemma 1, a solution of the timetable problem can be constructed. 


Proof. Let (U, S, w, a, b) be a WSP which is derived from a timetable problem 
(T, C, W, p, E, L, g, h, r, I, Ao) by Lemma 1 and A be a solution of the WSP. 
We shail show that the set A is also a solution of the timetable problem. In order 
to do this, we shall check that A satisfies the condition (1), (2),...,(8) in 
Definition 2. 

Condition (1) and (7) is obvious from the definition of A. 

Condition (2): If two difference elements (t,c,e,1,w,i) and 
(t’, c',e’, n', w,i') of A satisfy the following two conditions simultaneously: 


mest, 

{i,i+1,...,itan-1} Ni, 4+1,...,i+n'-1} 3), 
then set s,(¢, w, j) includes the two elements in A. Then we have 

|AN s(t, w, j)| = 2. (5.3) 
On the other hand, we have the following inequality: 


JANs(w l= Dd wu, 5) <b(s,(2, », /) =1, 
ueANsy(t. w.j) 
because A is a solution of the WSP and w(u, s,)=1 when uw is in s,. This is 
contradiction to (5.3) and then the set A satisfies the Condition (2). 
Condition (3) can be derived in the same way as Condition (2). 
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Condition (4) can be derived immediately from the inequality of a solution A 
and the definition of weight function w by checking the following equalities: 


AN\s3(t, w) = {(t, 6, e,n, w,i)EA:tet}, 
w(u, s3)=nifu ess. 


Conditions (5) and (6) can also be derived immediately from the inequality of a 
solution A, the definitions of subsets s, and s, and the definition of weight 
function w in the same way as Condition (3). 

Condition (8): From the inequality of a solution A, we have 

1=a(s)< >) w(u,s)<b(s)=1 for anys in S. 
uecAnNss 


On the other hand, w(u, s.) = 1 if u is in s, and |s.| = 1. Therefore we get A >5., 
thatis AD Ay. O 


By Lemma 1 and Lemma 2, we can get the main theorem. 


Theorem 2. A timetable problem can be embedded into a WSP. 


6. Conclusion and discussion 


A school timetable problem is a very important and interesting problem. The 
quality of a timetable has a great impact upon the effects of education for 
pupils/students and labor condition of teachers. From the educational point of 
view, a timetable is one of the most important environments of education as well 
as quality and quantity of teachers, equipments, curriculums and so on. In spite 
of the importance, it has not been used long enough to keep the quality of 
timetable good, because timetable construction must be done in a very tight 
schedule. All requirements for a timetable are usually fixed in one or two weeks 
before the beginning of a new term. This means that there is not enough time for 
timetable construction manually. The larger school has more difficulty since the 
time needed for constructing a timetable depends on the size of school. 

In order to overcome the difficulties, there have been many trials to construct a 
timetable by computers. They derived the theory of school timetables which 
intends to establish a simple and powerful model of timetable and find efficient 
algorithms for constructing timetables and checking the constrictivity. Among 
models and algorithms proposed in advance, some of them restrict conditions too 
strong to accept the timetable for actual schools, and others can support only 
trial-and-error manual operations by a computer. 

In this paper, we proposed a new theoretical model for school timetable 
construction, which can represent various complex requirements which frequently 
appeared in the practical timetable construction. These have been treated as 
exceptional conditions in the models developed in advance. In spite of powerful 
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representability of our new model, it seems very simple. Unfortunately, it has 
never been solved completely. This paper gives some theoretical results that are 
useful for establishing the efficient algorithms. 

The combinatorial problem derived from a timetable problem is not only 
important in the practical sense, but also interesting in the theoretical sense. The 
problem is a generalized one of various classical problems in graph theory, e.g. 
coloring problem, Latin square problem, network flow problem and resource 
dispatching problem. Full-scale theoretical attack to the problem is our future 
plan. 
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A k-hypergraph G consists of a vertex-set V(G) and an edge-set E(G), a set of k-subsets of 
V(G). If X cV(G), the edges of the induced subgraph G[X] are those edges of G whose 
vertices are all contained in X. If ve V(G), then G—v denotes the induced subgraph 
G[V(G) — v]. Hypergraphs G and H are hypomorphic if there is a bijection ¢: V(G)—> V(H) 
such that G-— vu =H — $(v), for all ve V(G). G and H are said to be reconstructions of each 
other. In case G $ H, G and H are said to be non-reconstructible. We describe the construction 
of a family of non-reconstructible 3-hypergraphs with 2” + 2” vertices, for all n, m > 1. 


1. Introduction 


Let k >1 be an integer. Let V be any set with |V| =k. We denote by (;) the 
set of all k-subsets of V. A k-hypergraph G consists of a set V(G) of vertices and 
a set E(G) <(") of edges. If X < V(G), then the subhypergraph induced by X 
is G[X] = {e € E(G) | e € (Z)}. The vertex-deleted hypergraph G — u is defined to 
be G[V(G) — {u}], where ue V(G). 

Two k-hypergraphs G and H are isomorphic, denoted G = H, if there exists a 
bijection 6:V(G)— V(H) such that e € E(G) if and only if 6(e) € E(H). G and 
H are hypomorphic if there is a bijection ¢:V(G)—V(H) such that G-u= 
H — $(u), for each e € V(G). 

If k =2 the graph reconstruction conjecture [2] states that if |V(G)|=3, then if 
G and H are hypomorphic, then they are also isomorphic. Hypomorphic graphs 
are said to be reconstructions of each other. If G#H, then G and H are 
non-reconstructible. The corresponding question for k-hypergraphs is: if 
|V(G)| >k =3, then if G and H are hypomorphic k-hypergraphs, is G =H? In 
[2] it was shown that this statement can be false for 3-hypergraphs, and that it is 
most likely false for all k > 2. (A 2-hypergraph is a graph in the usual sense, with 
no repeated edges.) In [2] a family of non-isomorphic, hypomorphic pairs of 
3-hypergraphs with 2” + 1 vertices was constructed, for all n 1. In this note, we 
generalize the construction of [2], constructing a family of non-reconstructible 
3-hypergraphs with 2” + 2” vertices, for all values of n, m= 0. 

We consider only 3-hypergraphs from now on, so that the term ‘“‘hypergraph” 
will always refer to a 3-hypergraph. The construction to be presented requires the 
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hypergraphs G; ,(n) of [2]. We begin by defining them and stating several of their 
properties proved in [2]. 


2. The families Gj ,(n) and A{(n) 


Let n=O be an integer. Define V, = {1,2,...,2"}. Define two maps Poaa, 
Pevens £7 Z DY Poga(kK) =2k —1 and Peven(k) =2k. We consider all maps of 
integers to act also on objects composed of integers, so that Doaa(V,,) U 
Peven(V,) = Va+i, for any n =0. similarly, we shall allow poag and Peven to act on 
graphs and hypergraphs via their edge and vertex sets. If x, ye V,, we compute 
the sum x + y mod V, as the unique integer z € V, such that x + y =z (mod 2”). 

If G and H are hypergraphs such that E(G)M E(H) =9, we write G+ H for 
the hypergraph whose vertex-set is V(G) U V(H) and whose edge-set is E(G) U 
E(H). If G is a hypergraph with vertex set V,_,, we define a new hypergraph, 
denoted Eight(G), with vertex-set V,, by replacing each edge of G by an 
eightfold copy, as follows. Let N=2""'. If {x, y,z}eE(G), then Eight(G) 
contains the edges 


{x,y,z},  {x,y,z+N},  {x,y+N,z},  {x+N,y,z}, 
{x,y +N,z+N}, {x +N, y,z +N}, 
{x +N, y +N, z}, {x+N,y+N,z+N}, 


where the sums are computed mod V,,. 


Definition 2.1. Let 1<j<k<n22, and let ee {0,1}. We define a family 
G;,(n) of hypergraphs with vertex set V,,. : 
(1) If 2 =2, then 


E(G4,(2)) = {{1, 2, 3}, {2, 3, 4}} 
E(G},,(2)) = {{1, 2, 4}, (1, 3, 4}}. 

(2) If n =3 and (j, k) = (1, 3) then 
E(G4.3(3)) = {{x, x + 1, x +2}, (x, x +3, x +5} |x € Vs} 
E(G},3(3)) = {{x, x +1, x + 3}, {x,x+5,x+7}|x V3}, 
where the addition is computed mod V3. 

(3) If n=3 andk <n then 
Gj. An) = Doaa(G7.x(n — 1)) + Peven(Gj.x(n — 1)), 


i.e. make two copies of G;,(m— 1), one odd and one even, and take 
their union. 
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(4) Ifn 23, k=n, andj=n-—1, then 
E(Gr-1n(n)) = {{y, y +N, 4x +y—3} |yeV,, x€V,—3}, 
E(Gr-1niny = {{9 Yt N, 4x +y — I ly € Vn, x €Vp—s}, 
where N = 2” — 1 and addition is computed mod V,,. 

(5) Ifn 24, k=n, andj<n—1, then 
Gf.(n) = Eight(G;,,-1(n — 1)), 
i.e., make an eightfold copy of each edge of G;,,_,(n — 1). 


It is easily checked that this defines a unique hypergraph for each value of 
(e, j, k), so that n(n — 1) hypergraphs have been defined. We state without proof 
a number of properties of the G;,(n). 


2.2. Gj,(n) and G?,,(n) have no edges in common if (6, i, m) #(¢, j, k). 

2.3. Gi,(n) partition (7), where 1 <j <k <n =2. 

2.4. G?,(n) = Gj ,(n). 

2.5. Let 0,:V,—V, denote the map x>2"—x +1, where xeV,. If n=3 and 
j#1, then 6,(Gj,(n)) = Giz"(n). 

2.6. Let n 22. Then 0,(Gj,,(n)) = Gin(n). 


We also define a family Aj(n) of graphs (2-hypergraphs) with vertex set V,, for 
1<j<n=2 and ¢ € {0, 1}. 


Definition 2.7. 
(1) Ifn =2 andj =1, then 


E(Ai(2)) = {{1, 3}}, 

E(A\(2)) = {{2, 4}}. 
(2) If n 22 andj =n, then 

E(A,(n)) = {{4x — 3, 4y —2}, {4x — 1, 4y} |x, ye Vio} 

E(AM(n)) = {{4x —3, dy}, {4x — 1, 4y 2} |x, ye Va}. 
(3) If n=3 andj <n, then 

Aj(1) = Poaa(Aj(n — 1) + Aj(n — 1)), 

Aj(1) = Peven(Aj(n — 1) + Aj(n ~ 1). 

For example A$(3) and A3(3) are illustrated in Fig. 1. 


The graphs Aj(n) also satisfy a number of properties which are stated here 
without proof. 


216 W.L. Kocay, Z.M. Lui 
1 2 3 4 1 42 3 
6 5 8 7 °% 57 6 
1 
AS (3) A3 (3) 
Fig. 1. 


2.8. A‘(n) and A?(n) have no edges in common if (6, i) #(¢, j). 
2.9. Af(n)) partition (7), where 1<j <n >2, and € € {0, 1}. 
2.10. If j<n, then 0,(A‘(n)) = A}~“(n). 

211. 6,(Ag(n)) = Af(n). 

2.12. Aj(n) = A;~*(n). 


We now introduce two sets of indeterminates {u;|ieV,} and {v,|ieV,}, 
using the symbols u and v, subscripted by elements of V,,. Denote these two sets 
by V,(u) and V,(v). We shall define families of hypergraphs with vertex-sets 
V,(u) and V,,(v), where n,m20. Write Gj,(n;u) for the hypergraph formed 
from Gj,(n) by replacing each vertex / € V, with u;€V,(u). Gi.(n; vu), Aj(n;u), 
and Aj(n;v) are defined similarly. Call these two sets of hypergraphs u- 
hypergraphs and v-hypergraphs, according as to whether the vertex-set is V,,(u) 
or V,(v). 

If A denotes a set of unordered pairs (e.g. the edges of a graph) and u is a 
vertex, then uA denotes the set of all triples formed by adding u to each pair of 
A, so that uA is the edge set of a hypergraph. 


Definition 2.13. We define hypergraphs M‘(n, m) and Ni(n, m), with vertex-set 
V,(u) U V,,(v), where 1 <i<n and 1<j<m. 
(1) If =1 and m=2, then: 


E(Mj(n, m)) = {ur E(A;(m; v)) | k €V,-1}, 

E(M\(n, m)) = {u2E(A'(m; v)) | k € Vai}; 
(2) If n=2 and m= 1, then: 

E(Ni(n, m))= {Ux-1E(A?(n; u)) | keV,,-1}, 

E(Ni(n, m)) = {v2 E(Ai(n; u)) |k € Vin}; 
(3) If n=0 and m = 2, then: 

E(M7(0, m)) =u, E(A;(m; v)), 

E(M}(0, m)) = u,E(Aj(m; v)); 
(4) If 2 =2 and m =0, then: 

E(N((n, 0)) =v, E(Ar(n; «)), 

E(N;(n, 0)) = v1, E(Ai(n; u)). 
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Each hypergraph G;,(n;u) or Gj,(m;v) has vertex-set V,(u) or V,,(v), 
respectively. They are called pure hypergraphs. Each edge of M;(n, m) contains 
one vertex of V,(u) and two vertices of V,,(v). Each edge of Nj(n, m) contains 
one vertex of V,,(uv) and two of V,(u). They are called mixed hypergraphs. 

If G is a hypergraph and u € V(G), the degree of u is deg(u, G), the number of 
edges of G containing u. The following results are easily verified from the 
definition or else can be found in [2]. 


2.14. LetieV,, n=3, and (j, k) #(1, 2). Then deg(i, Gj,(n)) =3 - 27%! *4, 
2.15. Letie V,, n2=3. Then 
ite, ifi<2" 2 ori>3-2", 


deg(i, Gi2(n)) = {3 —e, if2"?<i<3-2"-2, 


2.16. Let 1<j<n and ieV,, where n=2. Then |E(Aj(n))|=2"*/-? and 
deg(i, Af(n)) = 2°. 


By forming hypergraphs of the form Gf,(n;u)+M?(n, m)+ Nji(n, m) + 
Gi,(m;v), we shall construct families of non-reconstructible hypergraphs with 
vertex-set V,(u) UV,,(v). We first need some information about the isomorph- 
isms between the vertex-deleted hypergraphs. 


3. The mappings P,, 


Definition 3.1. Let n= 1 and let i€ V,. We define (see [2]) a family of mappings 
Pit Va EV, 0. 

(1) If 2 =1 then P, ,(2) =2 and P, (1) =1. 

(2) If i=2j,;#xeEV,_,, andn=2, then 


P,, (2x — 1) =2"-2x +1, 
P,, (2x) = 2P,_-1,;(x). 
(3) Ifi=2j-1,f#xeV,_,, and n=2, then 
P,, (2x — 1) =2P,_1,,(x)—1, 
P,, (2x) = 2" — 2x +2. 


Proofs of the following properties can be found in [2]. 


3.2. Let n=2 and j #1. Then P, ,(G¢,(n) — i) = Gj;"(n) — i, where ie V,,. 
3.3. Let 1<k<n2=2. Then P, (G{,(n) —i) =Gi,(n)—i, where ie V,. 
3.4. Let n=2. Then P, ,(Af(n) — i) = A}, ‘(n) — i, where ie V,,. 

3.5. Let 1<j<n 22. Then P, ,(A;(n) — i) = Aj(n) —i, where ie V,. 

3.6. P,, maps odd numbers to odd and even to even. 
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The significance of these lemmas is the following. Associated with each 
hypergraph G;,(n) is a hypomorph Hypo(G;,(n)), and associated with each 
graph A;(n) is a hypomorph Hypo(A;(n)): 

Hypo(Gj.(n)) = Gjx(n), if j #1; 

Hypo(G{.(n))= Gi.(n), iff=1; 

Hypo(Aj(n)) = Aj(n), if j<n, 

Hypo(Ai(n)) =A, ‘(n), if j=n. 
In all cases P,(G),(n)—i)= Hypo(Gj.(n))—i, and P, (Aj(n)—-i)= 
Hypo(A;(n)) — i, so that the same mappings P,; map a vertex deleted graph or 
hypergraph to the vertex-deleted graph or hypergraph of its hypomorph. Because 
of this, we are free to take any combination of these hypergraphs and the resulting 
hypergraph will still have a well-defined hypomorph for which the mappings P,, ; 
will map the vertex-deleted hypergraphs to the corresponding vertex-deleted 
hypergraphs of the hypomorph. 

We extend 6, to act on V,(u) and V,(v) by writing 0,(u;)=Ue) and 
9,,(U;) = Ve, (0): 


Definition 3.7. We also extend 6, and the P,,, to act on the set V,(u) U V,,(v) as 
follows: Let ie V,, and j € V,,. Then 
(1) Oui) = 6,(u;) and 67'(u;) = 6,,(u,); 
(2) PR«v;) =6,,(v;) forall v,eV,,(v), 
Pr(uj=u, if P,dk)=t; 
(3) Pr(u;)=98,(u;) forall u;,e V,(u). 
P@(u,)=v, if Pak) =; 
i.e. the superscript m refers to V,,(v) and the subscript n to V,(u). 


In this way, the P”, and P?"' will also be isomorphisms of the vertex-deleted 
hypergraphs of Gj,(n;u) and G;,(n; uv) with those of their hypomorphs. 


Lemma 3.8. Let 1<j< mand 1<i<n. Then: 


On(Mj(n, m)) = Mj(n, m), 
On(M n(n, m)) = Mr “(n, m), 
O™(Ni(n, m))=Ni(n, m), and 
On(N,(n, m)) = Ny “(n, m). 


Proof. Consider Mj(n, m). By 2.13, the edges of Mj(n, m) containing any u,_, 
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form a copy of A/(m;v), and the edges containing any uy form a copy of 
Aj(m; v). But 67(Aj(m; v)) = Aj “(m; v) by 2.10 and 3.7. Furthermore, 6, maps 
odd to even and even to odd, so that 07(M;(n, m))=Mj(n, m). The other 
results follow similarly, using 2.10, 2.11, and 2.13. O 

(Note: This result does hold when n =0 or m = 0 is allowed.) 


Lemma 3.9. Let ieV,, kEV,», and 1<i<m. Then P7A(Mi(n, m)—u)= 
M}-*(n, m) —u;, and P7”*(Mi(n, m) — v,) = Mj-*(n, m) — Ux. 


Proof. Consider P7(Mj(n, m)—u;), where ie V,. P7; maps each v, to 6,,(v,). 
By 2.13 the edges of M*(n, m) containing any u, from a copy of Af(m;v) or 
Aj(m; v), and 6,,(Aj(m; v)) = Aj-*(m; v), by 2.10. Furthermore, P,; maps odd 
to odd and even to even, by 3.7, so that by considering the Definition 2.13, it is 
easy to see that P”,(Mé(n, m) — u;) = Mj" “(n, m) — u;. 

Consider now P7*(Mi(n,m)—v,). If ieV, is odd, then the edges of 
Mi(n, m) — v, containing u; define a copy of Af(m;v)—u,. Pm* will map u; to 
6,(u;). By 3.5, Pn (Af(m)—k)=Aj(m)—k, so that Pr*(Af(m; v) —u,)= 
Aj;(m; v) — vz. But since i is odd, 6,(#) is even, so that in Mj(n, m)—v,, the 
edges containing 6,(u;) from a copy of Aj(m; v) — u,, i.e. in P7*(MF(n, m) — ux) 
the edges containing each u; are the same as in M}~‘(n, m) — v,. If i is even, then 
6,(i) is odd and the analysis is similar, so that P?(M;(n, m)—u;)=" 
M}-“(n,m)-u, O 


Lemma 3.10. Let ieV,, keV,,, and 1<j<n. Then P™(N;(n, m) —u;)= 
Nj *(n, m) — u;, and P™*(Nj(n, m) — vx) =Nj—“(n, m) — vg. 


Proof. The proof is like that of 3.9, except that the roles of u and v are 
reversed. O 


Lemma 3.11. Let ie V, and ke V,,. Then P™(M=(n, m) —u;)=M;,(n, m)—- 
and P™*(ME(n, m) — Ux) = ME(n, m) — vg. 


Proof. The proof is very much like that of 3.9. Consider P?(M;,(n, m) — u;), 
where ie€V,,. P™, maps each vu, to 6,,(v,). By 2.13 the edges of M;,(n, m) 
containing any u, form a copy of A?,(m;v) or A},(m;v), and 6,,(Af,(m; v)) = 

At(m;v), by 2.11. Furthermore, P,; maps odd to odd and even to even, by 3.7, 
so that by eco tas the Definition 2.13, it is easy to see that P77 (Mi(n, m) — 
u;) = M;7,(n, m) - 

Consider now ‘pm *(ME(n, m)—v,). If ieV, is odd, then the edges of 

M£(n, m) — u, containing u; define a copy of A?,(m; uv) — vz. P™* will map u; to 
6,,(u;). By 3.4, Pnx(Ai(m) —k)=A};(m)—k, so that P™*(AL(m; uv) —u,) = 
Al *(m; v) — v,. But since i is odd, 6,,(i) is even, so that in Mj,(n, m) — v,, the 
edges containing 6,,(u;) form a copy of A},(m; v) — vz, ie. in P7*(ME(n, m) — 
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v,) the edges containing each u, are the same as in M7,(n, m) — v,. If i ie even, 
then 9@,(i) is odd and the analysis is similar, so that P7,(M,,(n, m)—u,;)= 
Mf(n,m)-u, O 


Lemma 3.12. Let ic V, and keV,,. Then P™(Nf(n, m)—u;)=N;,(n, m)— u;, 
and P™*(N£(n, m)— ux) = Ni(n, m) — vx. 


Proof. The proof is like that of 3.11, except that the roles of u and v are 
reversed. O 


Lemmas 3.9 to 3.12 show that we can define hypomorphs for the Mj(n, m) and 
Ni(n, m): 


Hypo(M;(n, m))=M;~“(n, m), if j<m, 

Hypo(M,,(n, m))=M,,(n,m), if j=m, 

Hypo(Ni(n, m))=N;-‘(n,m), ifi<n; 

Hypo(Ni(n, m))=N,(n,m), ifi=n. 
In each case the hypomorphism is the identity mapping from V,(u) U V,,(v) to 
itself, and the mappings P”,; and P7'* are the isomorphisms from the vertex- 
deleted hypergraphs to those of the hypomorph. It is clear that the same 


mappings are also isomorphisms from the vertex-deleted hypergraphs of 
G;,(n;u) and Gj ,(m; v) to those of their hypomorphs, too. 


4. Non-reconstructible families 


In this section we put the preceding results together and construct a family of 
non-reconstructible hypergraphs with vertex-set V,(u)UV,,(v), giving 2” +2” 
vertices in total. We first summarize the results of Section 3 in a table for handy 
reference. 


Definition 4.1. Let n =2. Then G§ denotes the hypergraph ig_.Gj,(n). Gi(u) 
and G;(v) are defined correspondingly. 


The automorphism group of a hypergraph G is Aut(G), the group consisting of 
all permutations of V(G) which are isomorphisms of G with itself. The following 
lemma is proved in [2]. 


Lemma 4.2. Let n= 3. Then Aut(G‘)= (6,). 
Theorem 4.3. Let n,m2>3, n#m, and 1<j<m. Let X},m=G,(u)+ 


Mé(n, m) + G5(v), and let YF 4m = HypO(X5 nm) = Gi(u) + Mj “(n, m) + GE(v). 
Then Xj.n.m $ Yiinm 
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Proof. By 2.14 and 2.15 deg(u,, Gé(u)) is either 2%-? or 27"-3-1, and 
jeg(v,,G5(v)) is either 277"? or 27"-°-1. By 2.13 and 2.16, 
jeg(u,, Mj(n, m))=27*/~? and deg(v,, Mf(n, m)) = 2"*?. So deg(u,, Xj nm) = 
27-3 § +2747 and deg(uy, Xfnm)=27" >-€+2"1-3, where 6 and € are 
either 0 or 1. Since j<m and n #™m, it follows that vertices of V,(u) and V,,(v) 
are distinguished by their degree, so that if W: Xjnm—> Yjn,m iS any putative 
isomorphism, y must map V,(u) to V,,(u) and V,,(v) to V,,(v). Since each edge of 
G;(u) contains only vertices of V,(u), whereas edges of Mj(n, m) are mixed, it 
follows that y(G;(u)) = Gi(u), or wy € Aut(G*(u)). The only possible choice is y 
acts as 8, on V,(u), by 4.2. Similarly, y must act as 6,, on V,,(v), so that y = 0” 
is the only possibility. But 07'°(Mj(n, m)) = Mj;(n, m), by Table 1, from which it 
follows that X¢ p.m # Yin.m- 


For example, X2.3,. is illustrated in Fig. 2. Hypergraphs are not so easily drawn 
on paper as graphs are, where their structure is immediately visible. We have 
used the following technique of [1] to represent a hypergraph graphically. If G is 
any hypergraph and u e V(G), write A, for the set of edges containing vertex u. 
If we now remove u from each edge of A,, we get the edge set of a graph Aj,. So 
G defines a set {Aj,| u € V(G)} of graphs. Following [1], we say that G subsumes 
the graphs {Aj,| u €V(G)}. A hypergraph G is uniquely determined by its set of 
subsumed graphs, which provides a convenient visual representation of a 
hypergraph. In Fig. 2 we have distinguished the u; and v; by drawing the v; as 
dark dots. We then omitted the u or v, writing only i for u; or v,; however, the v; 
are in a smaller type to make it easier to distinguish them. 

The main result of [2] was the construction of a family of non-reconstructible 
hypergraphs with 2n + 1 vertices. In the present notation, the result can be stated 
as follows. 


4.4, Let n>2 and m =0. Then X, = G9(u) + Na(n, 0) and Y, = Gé(u) + Ni(n, 0) 
are hypomorphic but non-isomorphic. 


This is the reason that the definition of Mj(n, m) and N;(n, m) in 2.13 allowed 
n=0 or m=0. Most of the lemmas of Section 3 dealing with the Mj(n, m) and 
N;(n, m) do not hold as stated if m=0 or n=0 is to be allowed, but require 
special cases. Since in this paper we are mainly concerned with m, n #0, we do 


Table 1. 

Xx Hypo(x) O7(X) Lemma Remarks 
Gin; u) Ginyu) Gig (nyu) 2.5 j>1 
Gixlnsu) Ginsu) Ginsu) 2.6 j=l 
M;(n, m) M}-‘(n,m) = M¥(n, m) 3.9 j<m 
M,(1, m) M,,(n, m) M,, (n,m) 3,11 j=m 
N;j(n, m) N!-*(n, m) Ni(n, m) 3,10 i<n 


NE(n, m) N£(n, m) N'~*(n,m) (3.12 i=n 
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Fig. 2. X94. 


not include these special cases, except for this paragraph of explanation, and 4.4 
above. 

Theorem 4.3 only allows n #m. If n =m, we cannot use the degree argument 
of 4.3 to force p(V,(u)) = V,(u) and p(V,,(v)) = V,,(v). The easiest way to allow 
n= is the following. 


Theorem 4.5. Let n=m2>3 and 1<j<n. Let W;,=G,(u)+ Gis(niu)+ 
Mi(n,n)+Gi(v) and U;,,= Hypo(W;,,)=G,(u) + G23"(n;u) + Mj “(n, n) + 
Gi(v). Then W;,, # U*,,. 


gues 


Proof. By 2.14 and 2.15, deg(u,,Gi(u)) is either or 277-4-1, 
and deg(u,, Gi.3(n;u))=3. By 2.13 and 2.16, deg(u,, Mj(n,n))= 
deg(u,, Mi(n,n))=2"*'-3. So deg(u,, WS,) >deg(v,, Wi,,), from which it 
follows that any putative isomorphism y: W;,,— U;,, must map V,(u) to V,(u) 
and V,(v) to V,(v). By 4.2, w must act as 0, on V,(v), since only edges of G7(v) 
contain only vertices of V,(uv). Since Mj(n, n) is a mixed hypergraph, whereas 
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G<(u) + G§,3(n; u) is pure, it also follows that y(Mj(n, n)) = M}~*(n, n) and that 
w(Gi(u) + G§,3(n; u)) = Gi(u) + G35"(n; u). By Table 1, 6% maps Gé(u)+ 
G5,3(n;u) to its hypomorph, G{(u) + G33'(n; u), so that the set of all such 
mappings (which includes w), is the coset 67 Aut(G;,(u) + G3,3(n; u)). However, 
wy cannot equal 67, since 67(M;(n, n)) = M;(n, n)), By Table 1. 

Now Aut(G{) = (6,), by 4.2. Let @ € Aut(Gé + G§,3(n)). If all such @ are also 
in Aut(G;), then we would need to have y = 07, which is impossible. So if there 
is an isomorphism wy, it must act on V,(u) as 6,@ on V,, where @ ¢ Aut(G‘). 
Such a @ must map some edge of G33(n) to an edge of G7. By 2.1 and 4.1 it is 
easy to see that Gy =posa(Gr_-1) + Peven(Gr_-1) + Gr_i,,(n). Furthermore, 
G5,3(n) = Poaa(G5,3(2 — 1)) + Peven(G3,3(n —1)), if n 24, so that all edges of 
G5,3(n) contain only odd or only even vertices, and in G;,+ G5,3(n), only edges 
of Gi_1,,(n) contain both odd and even vertices. 

Now by 2.10, 0,(Aj(n)) =A} ‘(n). If we compare this with 2.13, keeping in 
mind that w acts as 8, on V,(v), we see that y must map Doaga(V,-1(u)) to 
Poaa(Vn -1(4)) and Peven(Vn—1(4)) to Peven(V,-1()) if we are to have 
y(Mi(n, n))=M}"*(n, n). Comparing this with 0,¢ we see that @ must 
interchange odd and even vertices. It now follows from the previous paragraph 
that @ maps Poga(Gr_-1 + G5,3(n —1)) tO Peven(Gr_1 + Gi,3(nm —1)), so that we 
have reduced the action of @ to an automorphism of G;_, + G5,3(n — 1), where 
n=4, To finish the proof (by induction), it is enough to show that G5 + G3, 3(3) 
has no automorphisms other than the identity. This is most easily seen from Fig. 
3, where G3 + G5,3(3) is shown in terms of its subsumed graphs. 0 


So W;,, and U;,, from another family of non-reconstructible hypergraphs. We have 
only considered m, n23. There are also many pairs of non-reconstructible 
hypergraphs with m or n equal to 2, not indicated by the above theorems. 
For example, G$.,(2, u) + M9(2, 2) + Mi(2, 2) + N9(2, 2) + Gi,(2, v) is non- 


Fig. 3. G5 + G5.3(3). 
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reconstructible, as is G{,,(2, u)+Mj(2, 2) + M3(2, 2) + Ni(2, 2) + N92, 2) + 
G‘2(2, v). 

Finally, we should like to mention that it would have been virtually impossible 
to develop these results without the use of a computer. We used the University of 
Manitoba’s Amdahl 5850 to construct the hypergraphs and test for isomorphism. 
Isomorphism was tested using B.D. McKay’s program GLABC of [3]. 
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1. Introduction 


Let G be a simple graph with vertex set V(G) and edge set E(G). For each a in 
V(G), let N(a) = {x € V(G):ax € E(G)} be the set of neighbours of a. A subset 
S of V(G) is called a closed set of G if, for each pair of distinct elements a, b in S, 
N(a)N N(b) cS. Let £(G) be the family of closed sets of G, inclusive of the 
empty set 6. Evidently, £(G) is closed under arbitrary intersection and it thus 
forms a lattice under set-inclusion (see Fig. 1). The lattice £(G), which was first 
introduced by Sauer (see [6] and also [3, 4, 5]), is called the closed-set lattice of 
the graph G. 

We shall now introduce, in terms of their closed-set lattices, various classes of 
graphs. A graph G is said to be minimally critical if £(G) # L(G — e) for each e 
in E(G), and maximally critical if £(G) # L(G + e) for any e in E(G), where G 
is the complement of G. We say that G is critical if G is both maximally and 
minimally critical. A graph G is said to be sensitive if for any graph G’ that 
£(G)=£(G') implies G=G'. Suppose that G and G’ are graphs such that 
L(G) = £(G') under a lattice isomorphism @. It is easily seen that ® induces 
naturally a bijection ¢: V(G)— V(G’) such that for each x in V(G), $(x) =x’ in 
V(G’) if and only if P({x}) = {x’} in L(G’). We call @ the bijection induced by 
®. A graph G is said to be strongly sensitive if for any graph G’ and for any 
lattice isomorphism ®: £(G) = L(G’), the bijection @ induced by @ is a graph 
isomorphism of G onto G’. 

For a graph G, the following implications follow immediately from definitions: 

G is strongly sensitive 
> G is sensitive 
> G is critical 
SGis { maximally cateal 
minimally critical. 

While strongly sensitive graphs are abundant (see [3]), not every graph is 

maximally critical or minimally critical. There exist maximally critical graphs 
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L(G) : 


Fig. 1. 


which are not minimally critical and vice versa. A class of critical graphs which 
are not sensitive can also be found in [4]. Now the question is: Are there sensitive 
graphs which are not strongly sensitive? 

It can be verified that the graph of Fig. 1 is the unique graph of minimum order 
which is sensitive but not strongly sensitive. Two more such graphs of order 6 are 
shown in Fig. 2. As a matter of fact, it is our main aim in this paper to introduce 
some methods of construction which produce various families of sensitive graphs 
that are not strongly sensitive. 

Throughout this paper, all graphs are assumed to be finite, nontrivial and 
simple. For all terminology on graphs and lattices not explained here, we refer to 
[1] and [2] respectively. 


2. Preliminaries 
We begin with the following observation: 
(*) Let G be a graph of order at least three. Then G is disconnected if and only if 


£(G) is direct product decomposable. Thus for graphs G and G’ of order at least 
three such that £(G) = £(G’), G is connected if and only if G’ is connected (see 


Fig. 2. 
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[6]). In contrast with this, we have £(P,) = £(O2), where P, is the path of order 2 
and O, is the graph consisting of two isolated vertices. 

For a subset X of V(G), the subgraph of G induced by X is denoted by [X] and 
the closed set of G generated by X is denoted by (X). For graphs G and G’, we 


shall write £(G)'= £(G’) to indicate that lattices ¥(G) and ¥(G’) are 
isomorphic under the lattice isomorphism ®. The image of X under a mapping @ 
is denoted by X@. 


Lemma 1 [3]. Let G and G' be graphs such that £G6)2 £(G') and let 
o:V(G)—>V(G’) be the bijection induced by ®. For any subset X of V(G), any 
closed set A of G and any vertex a in G, we have: 

(i) ®((X)) = (X¢), 

(ii) a € (X) if and only if o(a)€ (Xo), 

(iii) |(X)|=1(X@)I, 

(iv) (A) =A@, 

(v) degg(a) =1 if and only if deg;;(o(a))=1, provided that G is connected 

and |V(G)| 23, 


(*) 


(vi) L(X)]) = L([(X¢@)]), where S* is P confined to L([(X)]). 


The following result provides a simpler way to determine whether two 
closed-set lattices are isomorphic. 


Lemma 2 [3]. Let G and G’ be two graphs. Then £(G)=£(G’) if and only if 
there exists a bijection a:V(G)— V(G') such that 


(({x, y}))a = ({a(x), a(y)}) 


for any pair of distinct elements x, y in V(G). 


Remark. Given a@ satisfying the above condition, define ®: 2(G)— L(G’) by 
@(S) = Sa. It can be shown that @ is a lattice isomorphism and the bijection @ 
induced by @ is identical with a. 


3. Rambutans 


For any connected graph G of order n, denote by B(G) the graph of order 2n 
obtained by adjoining to each vertex of G a new end vertex. Every graph of the 
form B(G) is called a rambutan (also known as corona). It was noted in [4] that 
every rambutan is critical. It is easy to prove that for any complete graph K,, 
(n= 2), B(K,,) is strongly sensitive. While a class of nonsensitive rambutans was 
provided in [4], we construct here a class of sensitive rambutans which are not 
strongly sensitive. , 
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B(G(5,3)) : 


Theorem 1. For each complete graph K, of order n>=4 with V(K,)= 
{a,@,...,a,} and for each integer r such that2<r<n—2, let G(n,r) bea 
spanning subgraph of K,, with 

E(G(n, r)) = E(K,,) — {a,a;:1<i<j <r}. 


Then the rambutan B(G(n, r)) is sensitive but not strongly sensitive. 


Proof. Let {b,,...,5,} be the set of end vertices of B(G(n,r)) such that 
a,b; € E(B(G(n,r))) for each i=1,...,n". Let H be any graph such that 


£(B(G(n, r))) 2 L(A). Let V(H) = {x' = o(x):x € V(B(G(n, r)))}. By obser- 
vation (*), H is connected. By Lemma 1(v), {b;,...,5,} forms the set of end 
vertices of H. 

Since | {b;, b;})| =2 for all i,j =1,...,n with i#j, we have |({b;, bj})| =2. 
Thus d(b;,6;)=3 for all i,j=1,...,n with i#j. Hence H= 
B([{ai,..-, a,})). 


Claim 1. ajb; « E(H) for eachi=r+1,...,2. 

If a;b; ¢ E(H) for some i=r+1,...,n, then there exists j=1,...,” with 
j#i such that ajb;e E(H). But then |({a;, b/})|=2 while |({a,, b,})| =3, 
contradicting Lemma I (iii). 


Claim 2. aja; ¢ E(H) for each i=r+1,...,n and for each j=1,...,n with 
L#j. 

Since |({b;, a;})|=3, we have |({b;, a;})|=3 by Lemma I(iii). Hence by 
Claim 1, aja; € E(H). 
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Claim 3. a;a; ¢ E(H) for all i, j=1,..., r with i #j. 

Assume on the contrary that aja; € E(H) for some i, j=1,..., 7 with i #j. 
There exists ak =1,..., r such that b,a; € E(H). But then |({b;, a;} )| =3 while 
| {b,, a;)})| =2, contradicting Lemma 1(iii). 


It thus follows that there is a permutation y of {1,2,...,7r} such that 
4;4.,; € E(H) for each i=1,..., r. Note that we have B(G(n, r)) = H whatever 
py is, and by Lemma 2, £(B(G(n, r))) ‘2 £(H). Hence B(G(n, r)) is sensitive 
but not strongly sensitive as y may not be the identity mapping. O 


4. A construction based on strongly sensitive graphs 


There are strongly sensitive graphs in plenty. Indeed, it was proved in [3] that 
evey C,-free graph and the covering graph of every lattice are strongly sensitive. 
In this section, we shall introduce a method to construct sensitive graphs which 
are not strongly sensitive from a family of strongly sensitive graphs. 

Again, let K, be the complete graph of order n>4 with V(K,)= 
{a,,@2,...,4,}. For each integer r such that 2<r<n—2, let G(n, r) be the 
graph constructed in Section 3. Let = {G,41, G-+2,..., G,} be a given family 
of (n —r) connected strongly sensitive graphs (thus each G; is of order at least 
three). We construct a new graph G7(Y) based on G(n, r) and G as follows: 


V(G"(9) =V(G(n, r)) U A V(G) U {b;, b, ..., b,} 
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where {b,, b2,..., 6,} is a set of r new vertices and for eachi=r+1,...,n, a; 
is identified with exactly one vertex of G, and 


E(G7($)) = E(G(n, r)) U {a;b;:i=1,2,...,r}U U E(G,). 


j=r+l1 


Then we have 
Theorem 2. Every graph G?(&) is sensitive but not strongly sensitive. 


Remark. Graphs of the form B(G(n,r)) considered in Theorem 1 are not 
contained in the family of graphs shown in Theorem 2 since every edge 
considered as a graph of order 2 is not strongly sensitive. 


Proof. Let H be any graph such that £(G"())‘© L(H). Let V(H)={x' = 
o(x):x €V(G7(9))}. Again, H must be connected. By Lemma _ l(v), 
{bj,...,5;} is a set of end vertices of H. Hence d(b;, b;)=3 for all 
i,j=1,...,r with i#j. We shall now establish a series of claims: 


(1) For each i=r+1,...,n and for all x, y €V(G,), xy € E(G,) if and only if 
x'y'e€ E(A). 


(®,) 
By Lemma 1(vi), £(G;) ~ £({(V(G,))]), where 9, = ®|.4.<,). Since each G; 
is strongly sensitive, (1) thus follows. 


(2) Foralli=r+1,...,n,j=1,...,rand for every xe V(G;), bjx' ¢ E(H). 


Assume on the contrary that b;x' € E(H). Since G; is connected and |V(G,)| > 3, 
there exists y in G; such that xy e E(G,). By (1), x'y’e E(H). Hence x’, y'e 
({b;, y'}), and so x, ye ({b,, y}) by Lemma 1(ii). However, this contradicts 
the fact that |({b,, y}) NV(G,)| =1. 


Now, by (2) plus the fact that H is connected and {b;,..., b;} is a set of end 
vertices with d(b;, b;) =3 for each pair of distinct i,j =1,..., 7, we have: 

(3) There exists a permutation y of {1,2,...,7} such that a/b,,,;,€ E(H) for 
eachi=1,....9r. 


(4) For all i, j=1,...,r with i #j, aja’ ¢ E(H). 


If aja; €E(H), then |({a;, by,,)})|=3 but |({a,, by.)})|=2, contradicting 
Lemma lL (iii). 


(S) For alli=r+1,...,nandj=1,...,7r, a/aj eE(H). 
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Since |( {a;, byy})| = 3, we have |( {a/, by,,,)})| =3. By (3) and the fact that 
deg(b4,;)) ai 1, aja; € E(#H). 


(6) For alli=r+1,...,n, j=1,...,n with 1 #j and for every x e N(a,)M 
V(G;), ajx' ¢ E(H). 


Assume on the contrary that ax’ e E(H). By (1), ax €E(G;,) implies that 
a/x' € E(H). Thus x’ € ({a;, a;}), contradicting the fact that x ¢ ({a,, a;}). 


(7) For all i,j=r+1,...,n with i#j and for all xe V(G;)—{a;} and 
yeV(G), x'y’ ¢ E(H). 


Suppose x'y’ € E(H). By (6), we need only to consider the following two cases: 
(i) x €V(G;) — (N(a) U {a,}) and y = a, and (ii) x € V(G,) — {a)} and y € V(G,) - 
{a;}. In case (i), we have aja; € E(H) by (5). Hence aj € ( {x’, a;}), contradicting 
the fact that a, ¢({x, a,}) = {x, a,}. In case (ii), we have deg(x)>1 since 
deg(x')>1. Let ue N(x)— {a;}. By (1), u’x’€ E(H). Hence x'e ({u’, y’}), 
contradicting the fact that x ¢ ({u, y}) = {u, y}. 


(8) For alli, j=r+1,...,n with: #j, aja’ € E(H). 


Assume on the contrary that aja; ¢ E(H). Let x e N(a;)O V(G,). By (6) and 
(7), N(x')<(V(G,))o. Again by (7), N(a’)<V(H)—(V(G;))¢. Thus we have 
N(x')ON(a/)=8. Hence |{{x’, a/})|=2, contradicting the fact that 
I {x, a;})| 23. 


(9) For all i=r+1,...,n, j=1,...,r and for every xeV(G,) — {a}, 
ajx' ¢ E(H). 
If a/x' € E(H), then a; € ({a/, x'}), contradicting the fact that a; ¢ ({a;, x}). 


It thus follows from what we have discussed that G7(9) =H whatever y is, 


without violating the assumption that £(G"(@)) ‘= £(H) (see Lemma 2). Since y 
is not necessarily the identity mapping, we conclude that G?(Q) is sensitive but 
not strongly sensitive. O 


5. Graphs without cut vertices 


All the graphs constructed in the preceding two sections contain end vertices 
and hence cut vertices. Does there exist a sensitive but not strongly sensitive 
graph which contains no cut vertices? The answer is ‘yes’. Fig. 5 shows the two 
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Fig. 5. 


such graphs of minimum order. In this section, we shall furnish a class of such 
graphs. For each vertex x in G, we shall denote by N[x] the closed neighbour- 
hood of x, i.e. N[x] = N(x) U {x}. 


Theorem 3. Let G be a graph as shown in Fig. 6 where 

(i) each G; (i = 1, 2, 3) is a strongly sensitive graph, 

(ii) a,€ V(G,) — N{a], 6, € V(G,) — N{b] and c, € V(G;) — N{c]. 
Then G is sensitive but not strongly sensitive. 
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Proof. Let H be any graph such that £(G) 2 L£(H). Let V(A) = {x':x €V(G)}. 
Clearly, H is connected. Again, we shall establish a series of claims. 


(1) For each i=1,2,3 and for all x, yeV(G,), xyeE(G) if and only if 
x'y'e€ E(A). 

This follows immediately from Lemma 1(vi) and the fact that each G; is 
strongly sensitive. 


(2) For all i=1, 2, j=1, 2, 3 and for every x Ee V(G)), fix' ¢ E(H). 


It suffices to consider the case when i=j=1. Assume on the contrary that 
fix'€ E(H). Since G, is connected and |V(G,)|>3, there exists ye V(G,)N 
N(x). By (1), x’y’ € E(H). Thus x’, y'e ({f1, y'}), contradicting the fact that 
{fi yd) VG) = 1. 


(3) N(fi) = {di, ex} and N(f2) = {d;, e,} where {i, j} = {k, r} = {1, 2}. 


Since deg(f;) #1 for each i=1, 2, we have deg(f;) #1 by Lemma 1(v). Hence 
deg(f;)2=2 for each i=1,2. Since ({fi,f}) ={fi, fh}, we have ({f7, f3}) = 
{fi, f3} and hence N(f;) N(f2) =9. Moreover, fi f2¢ E(H), for otherwise by 
letting u'eN(fi)—{f2}, we have u'e€{dj,d3,e1,e5} by (2), and hence 
|({u’, f5})|=3, contradicting the fact that |({u, f£})|=2. Thus M(f,) U N(f2) = 
(di, d3, e1, €5} and deg(f;) =deg(f)=2. Since ({dy, d2}) = {di, dz, a, b}, we 
have ({d1, d5}) = {dj, d3, a’, b'} and hence we conclude that: (i) dif; and df; 
cannot be both in E(H) and (ii) d;f3 and d3f3 cannot be both in E(H). Similarly, 
by considering ({e1, €2}) = {€1, €2, 6, c}, we have: (iii) e,f; and e2f; cannot be 
both in E(H) and (iv) e;f3 and ef; cannot be both in E(H). Thus (3) is 
established. 


(4) For every pair of distinct vertices x, y in {d,, dz, e1, €2}, x'y’ ¢ E(H). 


Since |({x, f})|=]({y, f})|=2 for each i=1,2, we have |({x’, fi})|= 
\({y', fi})| =2 for each i = 1, 2. Now (4) follows from (3). 


(5) For each i= 1, 2, a’e;, c’dj ¢ E(H). 

Assume on the contrary that a’e; € E(H) for some i=1, 2. Let x EV(G,)N 
N(a). Then ax € E(H) implies that a'x'€ E(H) by (1). Thus |({x’, e/})|=3, 
contradicting the fact that |({x, e;})|=2. Hence a’e’ ¢ E(H). By symmetry, we 


have c’d/ ¢ E(H) for each i= 1, 2. 


(6) a'd/, b'dj, b’e;, c'e;, a’'b’, b'c' € E(H) for each i=1, 2. 
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Fig. 7. 


Since L({{a, b, d,, d}]}) = L({{b, c, e1, e2}]) = Ma (see Fig. 7), we have 
L({{a', b’, di, d3})) = L([{b', c', e1, €2}]) = Ma. As dids, ees ¢ E(H) by (4), (6) 


thus follows. 

(7) aibj, aici, bic, € E(H). 

Observe that [({a,, 51, c1})] =[{a1, 61, c1}] = C3, which is a strongly sensitive 
graph. By Lemma l(vi), we have [({aj, bi, ci})}=[{ai, bt, ci}] = Cs, 


establishing (7). 


(8) For each i= 1,2 and for every x €(V(G,) — {a}) U(V(G,) — {b}), dix'¢ 
E(#). 

This follows immediately from the fact that (V(G;))@ is a closed set of H for 
each i = 1, 2. 


Similarly, we have 


(9) For each i= 1,2 and for every x € (V(G,) — {b}) U(V(Gs) — {c}), ejx' ¢ 
E(H). 


(10) For each i = 1, 2 and for every x € V(G;) — {c}, djx' ¢ E(H). 


Since ({a, x}) = {a,x}, we have ({a’, x'}) ={a', x'} and hence d;x’ ¢ E(H). 
Similarly, we have 


(11) For each i = 1, 2 and for every x € V(G,) — {a}, e;x' ¢ E(H). 


(12) a'bj, a'ci, b'a}, b'c, c'a,, c'b; ¢ E(H). 
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This follows from the fact that (V(G,))¢, (V(G,))@ and (V(G;))@ are closed 
sets of H. 


(13) a'c’ ¢ E(H). 


If a’'c'e E(H), then by letting xe V(G,)NN(a), we have a'x’e« E(H) and 
hence a’ € ({c’, x'}), contradicting the fact that a ¢ ({c, x}) = {c, x}. 


(14) For all x e V(G,) — {b} and y eV(G,) — {b,}, a'x’, ayy’, c'x', ciy'¢ 
E(B). 


Since (V(G;))@ is a closed set of H, the above claim is true. 
(15) For all x € V(G3) — {c} and y € V(G;) — {c;}, a’x', avy’ ¢ E(H). 


Suppose a’x’ € E(H). Then deg(x’) > 1 in H, which implies that deg(x) > 1 in 
G by Lemma 1(v). Let ueN(x)—{c}. Then u’x'€ E(H) and hence x'e 
({a', u'}), contradicting the fact that x ¢ ({a, u}) = {a, u}. Since (V(G;))¢ is a 
closed set of H, we have a; y' ¢ E(H). 

Similarly, we have 


(16) For all x € V(G,) — {a} and y € V(G,) — {a,}, c’x', cry’ ¢ E(H). 


(17) For all x € V(G,) — {a, a1}, ye V(G,) — {b, by} and z € V(G;) — {c, ¢;}, 
x'y', y'z', x'z' ¢ E(H). 


Suppose x'y’e E(H). Then deg(x’)>1, which implies that deg(x)>1. Let 
ue N(x)— {a}. Then u'x’ € E(H) by (1). Hence x’ e€ ({u', y’}), contradicting 
the fact that x ¢ ({u, y}). Similarly, y’z’, x'z' ¢ E(H). 

We thus conclude from the previous discussion that G = H. Observe that there 
are in (3) four different combinations for N(f;) and N(f3), which show that the 
bijection @ induced by ® need not be the identity mapping. Moreover, by 


Lemma 2, the assumption that £(G) 2 £(H) is not violated when each of these 
four combinations holds for N(f;) and N(f2). Therefore G is sensitive but not 
strongly sensitive. O 


Remark. Condition (ii) in Theorem 3 cannot be replaced by the following: 
(*) a,€ V(G,) — {a}, b,e V(G,)— {b}, and 
Cc, € V(G;3) = {c}. 


For instance, the graph G of Fig. 8, which satisfies condition (i) in Theorem 3 and 
condition (*) above, is not even maximally critical (Y(G) = ¥¢(G +e), where 
e =xy). 
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Fig. 8. 
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In the context of the combinatorial theory of ordinary differential equations recently 
introduced by the authors, a concrete interpretation is given to the classical method of 


separation of variables. This approach is then extended to more general equations and applied 
to systems of differential equations with forcing terms. 


1. Introduction 


Classically, the ordinary differential equation 


y'=dy/dt=f(t)g(y), yO)=a (1.1) 
is said to have ‘separable variables’ since we can write, in an informal manner, 
(see [4], pp. 12-20) 


dy/g(y)=f(t) at (1.2) 


and integrate on both sides to get 


a(y= | duigtu)= | f(a) de (1.3) 


If we now let @‘~'? denote the inverse function of ®, the solution of (1) can be 
simply written as 


f 
yit)= 0-(| F(x) dx). (1.4) 
0 
In this paper we establish the validity of this solution by giving a direct 
combinatorial interpretation of the function y = @‘~!(t) as the solution of the 
* Partially supported by grants NSERC A5660 (Canada) and FCAR EQ 1608 (Québec). 
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autonomous differential equation 


y=agly), yO)=a (1.5) 


and then by showing that the combinatorial solution of (1.1) admits a kind of 
‘separation of vertices’ which implies (1.4). Moreover, we show that this 
combinatorial separation of variables can be applied to more general differential 
equations, or systems of differential equations, of the form 


y= LH(OBAy) (1.6) 
where j varies over some finite index set J. Examples include the linear equation 

y'=f(Oy + 8(t) (1.7) 
and equations of control theory such as 

y'=goly) + u(t)gi(y) (1.8) 


where gp and g, are given and u(t) is a variable function. 

In particular we establish by purely combinatorial methods a functional 
expansion for the solution of (1.6), in terms of iterated integrals, which implies 
the so-called fundamental formula of Fliess for systems of differential equations 
with forcing terms ({1, 2]). 

As in our previous papers [8,9], we work in a combinatorial setting where 
functions are replaced by species of structure over linearily ordered sets 
(L-species). The relationship with analysis is obtained by taking generating 
functions of L-species, a process which preserves all the usual operations of 
calculus, including integration, when properly defined on L-species. The reader is 
referred to [8], where the combinatorial theory of differential equations has been 
initiated and which serves as a basis for the present paper, for background and 
terminology. 


2. Combinatorial separation of variables 


We first consider the example of the differential equation 


YeT <¥%, Y(0)=1 (2.1) 
whose analytical solution is given by 
y(t) = 1/V1-. (2.2) 


The use of capital letters in (2.1) indicates the fact that we want a 
combinatorial solution, in terms of 1-species. As shown in [8], there is a unique 


L-species Y = Ayy:(T) which satisfies (2.1): the equation can be rewritten in the 
integral form 


Y=1+ { "¥V3(X) dX (2.3) 
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= fa 
fa a 


Fig. 1. 


and interpreted as follows (see Fig. 1), using the standard combinatorial 
interpretation of the integral (see [8, §2] and Fig. 5]: 
* there is a unique A7,:-structure on the empty set, denoted by 0; 
* any non-empty A7y:-structure can be canonically identified with the compos- 
ite structure consisting of 
(a) the minimum element of the underlying set 
(b) a singleton point 
(c) an ordered triple (left, middle, right) of similar A ,y:-structures. 

One can then attach the singleton point and the ordered triple of Ayy)- 
structures to the minimum element (then called a fertile point) as shown in Fig. 1. 
Iterating this procedure we find that A,y:-structures essentially consist of certain 
increasing labelled rooted planar trees (with labels increasing from root to leaves) 
such as the one illustrated by Fig. 2, on the linearly ordered set [19}= 
thy exch Los 

Let Y=Ter(T) denote the 1-species of increasing ternary trees, that is the 
solution of the autonomous equation 


y=¥, YO=1. (2.4) 


UB i esstearrsrererreoe 


My 
"Hy 
toay 
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Fig. 3. 


They are similar to A;,:-structures but simpler since there are no singletons 
attached to the roots; an example is given in Fig. 3, 

For n=O, let ¢, be the number of increasing ternary trees with n internal 
vertices. It is easily seen that ¢, = (2n —1)t,_,, for n=1, and that t= 1. Thus 


t, = (2n — 1) ++» (3)(1) (2.5) 
and we obtain the generating function, 


Ter(t)= >, t,t"/n! 


n2=0 
1 


“Vina 26) 


by the binomial theorem (see also {8], example 5.5). 
A comparison of Figs 2, 3 and 4 shows that an A7y:-structure can be viewed as 
an incrasing ternary tree where vertices have been replaced by cells containing 
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(unordered) pairs of elements, these cells being linearly ordered according to 
their minimum elements. This corresponds to substitution of species: we have 


Ary(T) = Ter(=) (2.7) 


where, as usual, equality stands for isomorphism of species. 
Using (2.6), we conclude that the generating function for A7y: is 


Ary3(t) = Ter( 5) 


1 
Aaa (2.8) 
as expected, and also, by the binomial theorem, that 
2n —1)---(3)(1) Pt” 
Aryx(t) = y ener (2.9) 


Note that 477=fj XdX and that, more generally, a structure of species 
J F=J§ F(X) dX can be represented as in Fig. 5. 

We end this section with the basic classical result on the method of separation 
of variables which is surprisingly simple from a combinatorial point of view. It 
comes as a natural generalization of the previous example and of example 4.5.a of 
{8] (the homogeneous linear equation). 


Proposition 2.1. Let Y = Agy(T) be the solution of the autonomous differential 
equation 


Y'=G(Y), Y(0) =Z; (2.10) 
then the solution Y = Ar(ryqcv)(T) of the equation with separable variables 

Y'=F(T)G(Y), YO)=Z (2.11) 
is given by 


Arnow(T)= Aoon( {FOO 4X), (2.12) 
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Proof. Simply observe that the increasing F(T)G(Y)-enriched arborescences that 
occur as the canonical solution of (2.11) can be viewed as Agy)(J F)-structures. 
See Fig. 6. O 


3. Iterated integrals 
Iterated integrals such as 
T S R 
| F(S)| GiR)| H(X) dX dR dS (3.1) 
0 0 0 


occur in the expansion of product integrals (see [3]) and in the solution of certain 
systems of differential equations (see e.g. [5], chap. 14). The standard com- 
binatorial interpretation of the integral can be applied to interpret iterated 
integrals (3.1) as L-species. One gets enriched increasing arborescences as in Fig. 
7s 

More generally, given a family F = {F;},., of L-species, where J is some finite 
set, and given any word a €J*, there is an induced iterated integral, denoted by 
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Fig. 8. 


2 F da, which is defined inductively as follows: 
If w=, the empty word, then 


[Fae = (3.2) 


the ‘empty set’ species; 
If a =jB, i.e. j €J is the first letter of the word a, then 


[Facey [goof Fapax. (3.3) 


An example of an {j Fda-structure is given in Fig. 9 (see below), with a= 
12112212 2. It is readily seen that an [§ Fda-structure is an assembly of 
(Lies Jo F(X) dX)-structures aligned in the natural order of their smallest 
elements. Hence we can state: 


Proposition 3.1. For any finite family {F;};-; of l-species, we have 


». [ sea = ex(> 2 Ss F(X) dx), (3.4) 


m@=!21122122 


Fig. 9. /(T)-structure. 
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4. A generalized combinatorial separation of variables 


The technique of combinatorial separation of variables can also be applied to 
equations, or systems of equations, of the form 


¥'=DS K(T)G(¥), (0) =Z (4.1) 


where j varies over some finite index set J. We will first deal with one simple 
equation: 


Y'=F(T)G(Y)+F(T)G(Y), Y¥(0)=Z (4.2) 


with J = {1, 2}, and later give appropriate generalizations as well as examples, in 
Section 5. 

Setting M=M(T, Y)=F(T)G,(Y)+ F(T)G,(Y), the canonical solution of 
(4.2) is the L-species Y=Ay=A,(T, Z) of so-called M-enriched increasing 
arborescences with buds (buds constitute an extra sort of elements, corresponding 
to the variable Z; see §3 of [8]). A typical A,,-structure is represented by Fig. 8 
where it is seen that the M-enrichment at each fertile point consists of either an 
F,(T)G,(Y)- or an F(T)G,(Y)-structure. 

Now combinatorial separation of variables can be applied as follows: associate 
with each M-enriched increasing arborescence a pair of structures as illustrated by 
Figs 9 and 10, for the arborescence of Fig. 8, thus defining two species /(T) and 
H(Z). 

The /(T)-structure is in fact an iterated integral [J F da-structure obtained by 
extracting the f{ F\- and J F-structures of the arborescence and aligning them in 
their natural order. Moreover, we take note of this relative order in what remains 
of the arborescence to obtain a structure on buds (see Fig. 10). This is the 
H(Z)-structure. 

It can be seen that the H(Z)-structure comes from the application to Z of a 
sequence of operators of the type 2, or 2, according to the same word a@ as in 
the iterated integral; the 2,’s are ‘eclosion operators’ (terminology due to G. 
Labelle; see [6, 7] and [8, §5]) defined, for j eJ, by 2, = G,(Z) 3/aZ (Fig. 11). 


m=121122122 


Fig. 10. H(Z)-structure. 
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2 WA 


Fig. 11. 


For instance the H(Z)-structure of Fig. 10 is a typical structure of the species 
(Bye Boe Di» By» Boe D, 0 Bio Bo B,)(Z). (4.3) 


Note that we compose operators from right to left so that the order of a@ is 
actually reversed. We introduce the notation 2, for the operators defined 
inductively by 

D.=Id, — Drip) = De B. (4.4) 

To summarize, combinatorial separation of variables associates to any M- 

enriched increasing arborescence, a word a@ and a pair of structures of species 


Jj Fda and 2,(Z) respectively. This correspondence is clearly reversible and 
bijective and also valid for an index set J of any finite cardinality. Hence we have: 


Proposition 4.1. Let Y = A,(T, Z) be the solution of the differential equation 
Ya DANG), YO)=z (4.5) 
where {F;};<, and {G;},<, are given finite families of |-species. Then we have 
AulT, 2)= & ( } 'F da\(9,(2)) (4.6) 
and, more generally, for any |-species ®, 


(47, Z)= > (f 'Fda)(Dq( (2). (4.7) 


aeJ* 


Proof. Only (4.7) remains to be proved. For any word a, the expression 
Ji Fda QD, can be interpreted as an operator on [-species; it should be clear that 
when this operator is applied to any ®(Z)-structure, the result is a P(A,,(T, Z))- 
structure. This induces the desired isomorphism. O 


Remark. We now show that formula (4.7) is also valid in the case of a system of 
differential equations of the form (4.5) where 


* Y is a vector of species Y=(Y,..., Yn) 
* each F is an lL -species as before, 
¢ each G, is a vector (G,,,..., G;,,) of species of n sorts 


¢ Zisa vector (Z,,..., Z,) of variables that correspond to n sorts of buds. 
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The system (4.5) can then be written as 
Yj=D K(T)GAN,-.-,¥%,), YAO)=Z, i=1,...,0 (48) 
jes 

As observed in [8, §6], the combinatorial solution of a system of equations is 
similar to that of a single equation. It consists, in the present case, of enriched 
increasing arborescences as in Fig. 8, with the following additional features: 

A ‘color’, i.e. an element of the set {1,...,#}, is attributed to each fertile 
point and the structure attached to a fertile point of color i is required to be a 
F(T)G,{Y,,.-., Y,)-structure for some jeJ. The ith component Y;= 
Aw, (T, Z) of the solution Y = A,,(T, Z) consistes of structures which are either a 
single bud of sort i or such colored arborescences whose root has color i. 

For each j € J, the ‘eclosion’ operator ; is defined by 

.,Z,)o 


Q, = > G, (Z,, oe 


aZ, (4.9) 


it applies to species ®(Z)= O(Z,,..., Z,) of nm sorts of buds as follows: a 
2, P(Z))-structure is a @(Z)-structure in which, for some color i, a bud of sort i 
has been replaced by a G, (Z,,... , Z,)-structure. 

With this interpretation, formula (4.7) is then again valid. This constitute a 
combinatorial proof of the ‘fundamental formula’ of Michel Fliess (see [1, 2]) for 
systems of differential equations with forcing terms, that is systems of the form 
(4.8), with J = {0, 1,..., m}, Fo(T) =1 and, for j= 1, K(T) = U,(T) is a variable 
‘input’ function. 


5. Examples 


The linear differential equation 
Y’=F(T)Y + G(T), Y(0)=Z (5.1) 


was studied from a combinatorial point of view in [8] and the classical formulas 
proven. However this is a special case of (4.5), with J= {1,2}, A(T)=F(T), 
F(T)=G(T), G,(Y)= Y, and G,(Y)=1, and the application of (4.6) gives an 
alternate formula for the solution of (5.1) which is closer to the simple 
combinatorial solution. Indeed, in this case, we have 9, = Z9/9Z and B, = 9/dZ 
and the only words a € J* for which the operator 2, applied to Z yields a non 
zero result are 


11---1=1* and 11---12=1*2, k20. (5.2) 
For these words, we have 


Dix(Z) =Z and Dye2)(Z) =]. (5.3) 
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Hence we get the following expression (to be compared with formula (4.16) of 
[8]) for the solution of (5.1). 


Proposition 5.1. The solution Y=A,(T, Z) of the linear differential equation 
(5.1) can be expressed as 


Ar(T, Z)= Pa [e G)d(1*)Z te fe. G) d(1*2) (5.4) 


where, for a € {1, 2}*, S32 (F, G) da denotes the iterated integral associated with a 
and the family (F,, F,) = (F, G). 


We now give an application to differential equation with forced entry of the 
form 
Y'=G(Y)+ U(T), Y(0)=0 (5.5) 


and in particular to the equation 
Y’=aY+bY*+U(T), ¥(0)=0 (5.6) 


that models an electric circuit with a quadratic resistance. U(T) represents a 
variable ‘entry’ or ‘control’ function (the current, in (5.6)), and Y(T), the output 
(voltage) function. These equations are of the form (4.5), with J= (0, 1}, 
F(T) =1, K(T)= U(T), Go(Y)=G(Y) and G,(Y)=1, thus Do= G(Z) 9/dZ 
and 2, = 0/9Z. By proposition 4.1, the solution Y=V(T) of the differential 
equation (5.5) can be expressed as 


T 

V(T)= > | (1, U) da 2,(Z) |z~-0. (5.7) 
awe{0,1}* “0 

The evaluation at Z = 0 in (5.7) comes from the initial condition V (0) =0. 

We see that V(T) depends not only on T but also on the variable function 
U(T), i.e. V = V(T, U). Thus (5.7) is a functional expansion of V(T, U) in terms 
of iterated integrals [f (1, U) da, for words a € {0, 1}* and the family (/, i) = 
(1, U). For the associated generating series, we get an analogous functional 
expansion 


t 
viiUuj= d+ VY [ (1, U) da (5.8) 
ae {0,1}* 0 

which we call the Fliess series of the system 5.5. Much attention is devoted in 

control theory to the numerical computation of the coefficients V, in the Fliess 

series. In particular, Fliess and his school (see [1, 2]) transform (5.5) into an 

algebraic equation satisfied by the formal power generating series of {V.}aes- 
se SS OVX, (5.9) 


ae {0,1}* 


in non commuting variables Xj and X, (the monomials X, are defined recursively 
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weight = ab? 


a=00!000101t01 


Fig. 12. 


by X,=1 and X;g= X;Xg). The equation is of the form S=f(S) and can be 
solved iteratively as a fixed point problem using a programmed computer. 

The approach proposed here can greatly simplify the computation (see [2]) of 
the coefficients V, since it provides a direct combinatorial interpretation for them. 
Consider, for example, the equatoin (5.6), where a and b are formal parameters. 
The combinatorial solution Y= V(T) consists of increasing arborescences with 
the following properties (see Fig. 12): 

¢ No buds appear, since, by the initial condition, V(0) =0. 

« There are two types (0 or 1) of fertile points. Fertile points of type 0 have 1 
or 2 sons (left, right) and carry a weight, a or b, accordingly; these sons are 
themselves fertile. Those of type 1 are simply the roots of f U-structures; they 
carry no weight. 

¢ The global weight of the structure is the product of the weights of the fertile 
points of type 0. 

The word a € {0, 1}* is obtained by reading successively the types of the fertile 
points in their natural order. Combinatorial separation of variables then gives, on 
the one hand, a [27 (1, U)da-structure (Fig. 13) and, on the other hand, a 
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b 
Xe va weight = a°b? 


1% a = 00100010101 


Fig. 14. 


so-called ‘weighted increasing 1-2 planar tree’ on k = |a| phantom buds (Fig. 14). 
The latter has labels corresponding to the order of enclosures, which can be of 
two kinds: 


(aZ + bZ)d 
Y= 
OZ 
The word a and the weight a‘b/ are the same as those of the arborescence and 
can be read on the associated tree. 


For a € {0, 1}*, let P(a) be the set of all such 1-2 trees over a and let P,(a, b) 
be its generating polynomial: 
P,(a,b)= > weight(p). (5.11) 


peP(a) 


a 
and 9,=—. (5.10) 


As formula (5.7) shows, P(a) is the set of empty (i.e. no buds) 9,(Z)-structures, 
that is 
P(a@) = D,(Z)|z=0- (5.12) 


In terms of generating functions, we obtain the following: 


Corollary 5.2. The coefficients V, in the Fliess series (5.8) for the solution 
Y=V(T) of the equation Y'=aY+bY*+U(T), Y(0)=0 are the generating 
polynomials of 1-2 trees over a: 


V, = P,(a, b). (5.13) 
In conclusion, we note that the polynomials P,(a, 6) can be given a more 


explicit form using standard combinatorial techniques such as Motzkin words and 
paths and Dyck words (see [10]). 
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A well-known theorem of Heawood states that 3-edge-coloring bridgeless planar cubic 
graphs—and, hence, the four-color theorem—is equivalent to labeling vertices with either +1 
or —1 so that the sum around any face is 0 (mod 3). In this paper we introduce the notion of 
“angle-labeling” and give results analogous to Heawood’s for bridgeless planar graphs with 
vertices of degree 2 or 3. 


1. Introduction 


The four-color theorem has had a long and fruitful history. The fecundity of the 
four-color problem has been splendidly documented in the award-winning paper 
“Thirteen Colorful Variations on Guthrie’s Four-Color Conjecture” by Saaty 
which appeared in 1972 [4], and which was the basis for a later book [5]. 

Two of the variations which are of interest to us in the present paper are the 
edge-colorings of P.G. Tait and the vertex-labelings of P.J. Heawood. Tait 
initiated the study of edge-coloring in 1880 by translating the problem of coloring 
maps to the problem of coloring the edges of cubic graphs [6]. In so doing he 
mistakenly believed that he had proved the four-color theorem. The following 
theorem is proved in [1, pp. 26-27]. 


Theorem 1 (Tait). The four-color theorem is equivalent to the statement that 
every bridgeless planar cubic graph is 3-edge-colorable. 


Since the four-color theorem is now known to be true, it follows that every 
bridgeless planar cubic graph is 3-edge-colorable. We offer the following 
conjecture which extends this result. 


Conjecture. Let G be a bridgeless planar graph each of whose vertices is of 
degree 2 or 3 and having at least two vertices of degree 2. Then G is 
3-edge-colorable. 


(We note that such a graph having exactly one vertex of degree 2 is not 
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3-edge-colorable, for the same reason that a cubic graph that has a bridge is not 
3-edge-colorable. ) 

The main goal of this paper is to present a method of attack on the problem of 
proving this conjecture. Our approach is to mimic the way in which Heawood 
translated the problem of coloring the edges of a graph to the problem of solving 
a system of congruences arising from the labeling of the vertices [2]. 


Theorem 2 (Heawood). A bridgeless planar cubic graph is 3-edge-colorable if 
and only if its vertices can be labeled either 1 or 2 in such a way that the sum of the 
labels around any face is divisible by 3. 


2. Angle-labeling of cubic graphs 


We begin our discussion of angle-labeling of cubic graphs by stating a result 
from [3] in which an ‘‘angle-labeling” of a graph means that the “‘angles” of the 
graph are labeled either 0 or 1. (Since an angle can be defined formally as a pair 
of adjacent edges, we need not require the graph to be planar.) With each vertex 
of the graph we associate six angles, three “interior” angles and three ‘“‘exterior’’ 
angles, as shown in Fig. 1. We abuse notation slightly and allow a; to represent 
both the angle and its label. 


Fig. 1. 


Theorem 3 (Loupekine and Watkins). A cubic graph is 3-edge-colorable if and 
only if there is an angle-labeling of the graph with 0’s and 1’s such that at each 
vertex the six associated angle-labels satisfy the congruence 


Qa, + a, + a3 —@4—a5— a, =2 (mod 4). 


We now state a result for planar cubic graphs in which we label each angle of 
the graph either 1 or 2 and, with each edge of the graph, we associated four 
angles, as shown in Fig. 2. This result follows immediately from Theorem 6 in the 
next section. 

at a2 


Fig. 2. 


Theorem 4. A bridgeless planar cubic graph G is 3-edge-colorable if and only if 
there is an angle-labeling of the graph with 1’s and 2’s such that 
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(a) at each edge the four associated angle-labels satisfy 
a,+a,+a,+a,=0 (mod 2); 


(b) the sum of the angle-labels interior to any face of G is 0 (mod 3). 


3. Angle-labeling of graphs having vertices of degree 2 and 3 


We now turn to graphs that have vertices of degree 2 as well as vertices of 
degree 3. 


Theorem 5. Let G be a bridgeless planar graph each of whose vertices is of degree 
2 or 3. Then, G is 3-edge-colorable if and only if there is an angle-labeling of the 
graph with 1’s and 2’s such that 

(a) the sum of the angle-labels interior to any face is 0 (mod 3); 

(b) the sum of the angle-labels around any vertex is 0 (mod 3). 


Proof. First we assume that the angles of G have been labeled with 1’s and 2’s so 
that conditions (a) and (b) hold. We color the edges of G with colors 0, 1 and 2 as 
follows. Begin by coloring some edge with color 0. Then, for an edge e, adjacent 
to some previously colored edge e,, we color e2 with the sum (mod 3) of the color 
of e, and the label (or labels) of the angle (or angles) to the right of the path e,e>. 
We then repreat this process, always coloring a new edge that is adjacent to an 
edge that has been colored previously. 

We show that this procedure leads to a 3-edge-coloring of G. Let vu be the 
vertex incident to each of e, and e,. If v has degree 2 then, by condition (b), one 
angle at v has label 1 and the other angle at v has label 2; thus, whichever label is 
to the right in going from e, to e2, we get a color for e, that differs from the color 
for e,. If v has degree 3, then by condition (b) all the angles at v have the same 
label; thus, whether there is one angle or two angles to the right in going from e, 
to e,, we get a color for e, that differs from the color for e,. Furthermore, it 
follows from condition (a) that the color assigned to an edge is independent of the 
path used to reach that edge. Thus, G is 3-edge-colorable. 

In order to prove the converse, we assume that the edges of G have been 
3-colored with colors 0, 1 and 2. Then to label the angle between two edges—say 
e, and e,, colored respectively c, and c,, with e, counterclockwise from e,—we 
label the angle with c, — c, (mod 3). This labeling certainly satisfies condition (a) 
since, if the edges clockwise around any face are colored c,,c,....,cC,, then 
the sum of the angles around this face is (c, — c,) + (¢3 — €2) +--+ - +(e, —G,) =O. 
This labeling also satisfies condition (b): at a vertex of degree 2, the two adjacent 
edges are colored c, and c,, say, and one angle is labeled c,-c, and the other 
C1 — C2, so one label is 1 and the other label is 2; at a vertex of degree 3, the three 
edges are colored, in counterclockwise order, either 0, 1, 2 or 0, 2, 1; in the first 
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case the three angles are all labeled 1, and in the second case the three angles are 
all labeled 2. This completes the proof of the theorem. O 


In the preceding argument we always gave the same label to all three angles 
around a vertex of degree 3; this leads to the following restatement of Theorem 5. 


Corollary. Let G be a bridgeless planar graph each of whose vertices is of degree 2 
or 3. Then, G is 3-edge-colorable if and only if there is a labeling with 1’s and 2’s 
of the vertices of degree 3 and of the angles at vertices of degree 2 in which one 
angle is labeled 1 and the other angle is labeled 2, such that 


around any face the sum of the vertex-labels (for 
vertices of degree 3) and the interior angle-labels 
(for vertices of degree 2) is 0(mod’3). 


We can now give the generalization of Theorem 4 for graphs having vertices of 
degree 2 or 3. This characterization is in terms of the four angles associated with 
an edge (see Fig. 2). 


Theorem 6. Let G be a bridgeless planar graph each of whose vertices is of degree 
2 or 3. Then, G is 3-edge-colorable if and only if there is an angle-labeling of the 
graph with 1°s and 2’s such that 
(a) the sum of the angle-lables interior to any face is 0 (mod 3); 
(b) for each edge e joining two vertices v and w the four angle-labels associated 
with e satisfy 


a,+a,+a,+a,= deg(v) + deg(w) (mod 2). 


Proof. First we assume that G is 3-edge-colorable. Then, by Theorem 5S, there is 
an angle-labeling such that condition (a) holds and also the sum of the 
angle-labels around any vertex is 0 (mod 3). If a vertex has degree 3, therefore, 
all three angles have the same label, whereas, if a vertex has degree 2, then one 
angle is labeled 1 and the other 2. Thus, if both v and w have degree 3, then the 
sum of the four angles at an edge is 0 (mod 2); if, say, v has degree 3 and w has 
degree 2, then the angle sum is odd, if both uv and w have degree 2, then the angle 
sum is 6. So, condition (b) holds as well. 

Conversely, suppose that the angles of G have been labeled so that conditions 
(a) and (b) hold. We show that G has a 3-edge-coloring. For each edge e, let v be 
the vertex at one end of e and let a, and a, be the associated angle-lables at that 
end. Then we label e with a, + a,+deg(v) +1! (mod 2). (Because of condition 
(b), we get the same label for e if we use the other end and label e with 
a, + a; + deg(w) + 1 (mod 2).) This labeling of the edges partitions the edges of G 
into two classes, those labeled 0 and those labeled 1. We claim that the edges 
labeled 1 form a disjoint set of cycles. 
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Let uv be a vertex of degree 3 and let a,, a2, a; be the three angles at v. Then 
the sum of the labels for the three edges incident to v is 2a, +2a,+2a;+ 
3 deg(v) + 3=0 (mod 2). Thus, at a vertex of degree 3, either all three edges are 
labeled 0 or two edges are labeled 1 and one edge is labeled 0. Now let v be a 
vertex of degree 2, and let a, and a, be the two angles at v. Then the sum of the 
labels for the two edges incident to v is 2a, + 2a,+2deg(v) +2=0 (mod 2). 
Thus, at a vertex of degree 2, both edges have the same label. It follows that the 
edges labeled 1 form a disjoint set of cycles in G. 

Thus, by the Jordan Curve theorem, we can color the regions between these 
cycles black or white so that these cycles form the boundary between regions 
colored black and regions colored white. If an angle is in a white region, then we 
do not change its label. If an angle is in a black region, then replace its label a by 
3 —a. Condition (a) still holds. Because of the way in which edges were labeled, 
an edge e incident to a vertex of degree 3 is labeled 1 if a, #a,, and is labeled 0 if 
a,=a,. Therefore, if all three edges incident to such a vertex are labeled 0, then 
all three angles have the same label, and, since such a vertex is an interior point 
to a white or black region, this remains true after our color switch. On the other 
hand, if two of the three edges are labeled 1, then the label of the angle between 
these edges differs from the label of the other two angles. However, after the 
color switch, all three have the same label. Thus, we can assign a label 1 or 2 to 
each vertex of degree 3—namely, the label of the surrounding angles. 

Now, condition (a) is simply the condition given in the corollary to Theorem 5, 
so by that corollary, G is 3-edge-colorable. O 


We conclude with a result that combines angle and edge labeling. As labels, we 
use elements of the symmetric group 


S; = {a, b:a°=b’ =1, ab = ba’}, 


in which 1 is the identity element. We label angles with either a or a’ and we 
label the “‘sides” of edges with either 1 or b. Thus, we associate with each edge 
four angle-labels and two edge-labels; with each face, we associate an angle-label 
at each vertex on the boundary and an edge-label for each edge on the boundary. 


Theorem 7. Let G be a bridgeless planar graph each of whose vertices is of degree 
2 or 3. Then, G is 3-edge-colorable if and only if there is a labeling of G with 
elements from S; in which the angles of G are labeled a and a” and the sides of 
edges are labeled 1 and b such that 
(a) the product in S, about any face of all the interior angle and edge labels 
(taken in order) is the identity 1; 
(b) for each edge e the total number of a’s and b’s is congruent to the sum 
(mod 2) of the degrees of the endpoints of e. (Note that a* counts as two 
a’s.) 
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Proof. First we suppose that G can be 3-edge-colored. Label both sides of each 
edge with the identity 1. Then label the angles with 1’s and 2’s so that the two 
conditions of Theorem 6 hold. If at each angle we now replace its label x with a’, 
then conditions (a) and (b) are immediately satisfied. 

Conversely, assume that G has been labeled so that condition (a) and (b) hold. 
The idea is to reverse the above procedure, but first any b’s that occur as labels 
have to be eliminated. By condition (a) the number of b’s around any face is 
even. We can eliminate the b’s one pair at a time by replacing the label b with the 
identity 1 and ‘sliding’ the b counter-clockwise around the interior of the 
boundary of the face, changing any angle-label we pass from a to a” (or from a” 
to a), until the b reaches another edge labeled b, at which point we replace the b* 
with the identity 1. Since ab = ba” (and a*b = ba), this maintains condition (a) at 
each stage and, since an edge losing or gaining one b also loses or gains ana, the 
total number of a’s and b’s does not change (mod 2). In this way we remove all 
b’s. Now, at each angle, replace the label a* with x. Conditions (a) and (b) of 
Theorem 6 now hold, so G can be 3-edge-colored. O 
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A study of the orders of maximal induced trees in a random graph G, with small edge 
probability p is given. In particular, it is shown that the giant component of almost every G,, 
where p =c/n and c >1 is a constant, contains only very small maximal trees (that are of a 
specific type) and very large maximal trees. The presented results provide an elementary proof 
of a conjecture from [3] that was confirmed recently in [4] and [5]. 


1. Introduction 


Denote by 4(n, p) the set of all graphs with vertex set V = {1,2,...,} in 
which the edges are chosen independently and with probability p. In other words, 
if G is a graph with vertex set V and it has m edges, then 


P({G}) =p™(1— pyO-™. 
Let G, stand for a random graph from &(n, p). We say that almost every (a.e.) 
graph in Gn, p) has a certain property Q if P(Q)— 1 asn—>~. 

What is the probability distribution for the orders of induced subgraphs in a.e. 
graph G,? A lot of interesting results devoted to this fundamental question in 
random graph theory one can find in [9] where the case p-constant was examined 
in great detail. 

In this paper we shall be assuming that the edge probability p depends on n and 
p(n)—0 as n—~. Under this assumption we focus our attention on the orders of 
maximal induced trees that are contained in a.e. graph G,. A tree that is not 
properly contained in any larger tree is called a maximal tree. Since we deal here 
only with induced trees therefore the word ‘induced’ will be very often omitted. 

The symbols o and O are with respect to nm unless stated otherwise. All 
logarithms are natural. Also |x| means an integer part of x. Finally, throughout 
the paper y(n) stands for a sequence tending to infinity (usually arbitrarily slowly) 


asn—o, 


2. Maximal induced trees 


Given a random graph G,, what numbers are likely and are not likely to occur 
as orders of maximal trees in G,? This section is devoted to an examination of 
this question. 
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We say that a graph has property 9; if it contains a maximal tree of order k. 
Let us begin our considerations with the following result. 


Theorem 1 ((3]). Assume that 0<p <1 and ¢>0 are fixed. Then a.e. graph G, 
has property J, for every integer k satisfying 


logn logn 
1+ <k<(2- 
( Sond ( ond 
where d=1/q. 
. : logn : 
Notice also that a.e. G, is such that for every set S of s<(1-«) fogd vertices 
there is a vertex v ¢ § joined to precisely one vertex in §. Thus, in particular, 
| 
every tree of order s<(1-«) oe is contained in a tree of order 5 +1. 


Furthermore (see e.g. [3]) almost no G, contains a tree on more than (2 + €) 
logn 
log d 

The above situation becomes more involved in a case when p = p(n)—0 as 
n—x. Assume first that 0<p<c/n where 0<c<1 is a constant. Then a 
random graph G, is very sparse and contains many components which are trees. 
In fact, suppose p = c/n, 0<c<1, and y(n)—~«. Then a.e. G, is such that every 
component is a tree or a unicyclic graph and there are at most y(n) vertices on 
the unicyclic components (see e.g. [2, p. 99}]). Obviously every isolated tree is 
itself a maximal tree and every unicyclic component contains at least three 
maximal trees. A wide variety of results devoted to possible orders of isolated 
trees in such a random graph the reader can find in Bollobas [2, Ch. V. 3]. As an 
example let us quote ((2, p. 106 and 108]) the following fact. (We say that a graph 
has property ¥, if it contains a tree component on k vertices). 


vertices. 


Theorem 2. Assume that p = 1/n. 
(i) If ko/n*?—> «=, then almost no G, has J, for every k = ko. 
(ii) If ky/n?° 0, then a.e. G, has J, for some k >= kg. 
(iii) If ko/n?’ 0, then a.e. G, has $, for every k <kp. 


Now let us turn our attention to a case when a.e. graph G, is the union of one 
giant component, small unicyclic components and small tree components. 
Namely, we shall be assuming that p = c/n where c> 1 is a constant. Denote by 
L(G,) the order of the ith largest component of G,. Then (see e.g. [2, p. 138}) 
we know that the order of the giant component of a.e. G,, satisfies 


ILi(Gp) ~ (1 — t(c))n| < y(n)n? 
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where y(n)— © and 


bee! 
Ke) == dp (ce) (1) 
Moreover, for every fixed i =2 
L(G,) - : (log n — 3 log logn)| < y(n) (2) 
where 
a=c-—1-loge. (3) 


Our first result specifies those numbers that are not likely to occur as orders of 
maximal trees in the giant component. Let 


h(c, ~) =(1—-a@)'""%c exp| - ?- (a- ace. (4) 


It can be checked that, for a given c, there is a= a,(c) such that the function 
h(c, &) is increasing for 0< @ <a and is decreasing for a< a<1. Moreover 
h(c, @) > 1. Let a(c) be smaller positive root of the equation 


h(c,a)=1, O0<a<l. (5) 


The following result is true. 


Theorem 3. Let p =c/n where c>1 is a constant and y(n)—>~. Then the giant 
component of a.e. graph G, contains no maximal trees of order k, where 


y(n) <k<a(c)n. (6) 


Proof. Let X, be the number of all trees on k vertices in G,. Then the expected 
value of X; is given by 


E(X;,) = (Tet — p)@-E-D), 7) 


Furthermore, let Y, stand for the number of those trees of order k which are 
maximal in G, and, in addition, for every such tree there is a vertex in G, 
connected with at least two of its vertices. It is clear that every maximal tree 
contained in the giant component is of this kind. Moreover 


k 
E(¥) <E(X,)(n ~ ky (5 )p7tt ~ kp(1 - py 


In order to get a good upper bound for E(Y,) we use the following consequence 
of Stirling’s formula: if y(”) is a sequence tending to infinity as n — , then for all 
large n and all @ such that am is integer and y(n) < an <n — y(n) it is the case 
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that 


n 1 a = l-ay-n 
(on) <meta a ee a (8) 


Consequently, making the change of variable k = an and applying the inequality 
1—x <e “ together with the formula 


1—x=exp[-x-5 + 0(r)| (9) 
we find that, for all large n, 


E(%) = 0 = {a¢c, "| 


where h(c, @) is given by (4). Now, for all a such that y(n)/n < a < a(c) we have 
h(c, w)<1 and consequently the summation of E(Y,) over all k satisfying (6) 
tends to zero as n> ~, This implies the thesis of the theorem. O 


Remark. The above proof gives also some additional information about the 
orders of small components of G,. Namely, for p=c/n, c>1, the unicyclic 
components that appear in a.e. G, are of fixed orders only. 

The last theorem suggests that the giant component of a random graph 
G.,(¢c >1) can contain only some maximal trees of small orders and some very 
large maximal trees. It appears that the small maximal trees that can be contained 
in the giant component are of a very special structure. We introduce the following 
definition. Let T be a tree in a graph G and N(T) be the set of all neighbors of T. 
Assume that |N(T)| =1 and, in addition, that the vertex from N(T), which is 
called here a horn, is connected with exactly two vertices from T. Then T U N(T) 
will be called a horn-tree. We have the following theorem. 


Theorem 4. Let p = c/n where c>1 is a constant. Then the only maximal trees of 
order less than y(n) which can appear in the giant component of G,, are horn-trees. 


Proof. The probability that G, contains a subset on k vertices, where 4<k < 
y(n) and y(n)— = arbitrarily slowly as n—> =, that spans more than k edges is at 


most 
y(") n (3) (5) ‘ 
> (i) > | 27 ta py" 
k=4 f=k+1 


Clearly, this implies our thesis. UO 
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Observe that every horn-tree contained in the giant component of G, is a cycle 
U,U2-+-U,v, where vj, 1<j<i-—1, is the root of an isolated tree in G, \ 
{U,,..., Uj—1, Uj+1,--+, U;} and v; is the horn which belongs to the giant 
component. In other words these ‘trees’ hang on to the giant component by their 
horns. This, together with the last two theorems, implies easily the following 
result. 


Theorem 5. Assume p =c/n, where c>1 is a constant. Then the giant component 
of a.e. graph G, contains a tree on at least |a(c)n| +1 vertices where a(c) is 
defined by (5). 


Proof. Having in mind Theorems 3 and 4 it suffices to show that the number of 
all horn-trees contained in the giant component is finite. In fact, if this is the case, 
then picking up an edge that is not in a horn-tree (a lot of such edges will be 
available) one can surely extend this edge to a tree on at least |a(c)n| +1 
vertices. First we determine the expectation of a random variable H, defining the 
number of all horn-trees of order <y(n) that are in the giant component. Let HT 
be such a horn-tree and denote its horn by h. Then HT \ {A} must be an isolated 
tree in G,\{h} and the vertex A must belong to the giant component of 
G, \(HT \ {h}). These two events are independent and the probability of the last 
one, say r,x, tends to 1—¢(c) as n—»~, where ¢(c) is defined by (1) (see [2, 
p. 138}). Consequently, with E(X;) given by (7), 


E(H) = s E(X,)(n - ‘(5 )"a — p)Krrk-DHE=Dp 


k=2 2 
c 7 @ (ce~“)* Par 
25 (1 t(c)) ik =r : 


The last series converges, so denote its sum by A(c). Standard arguments may 
now be used to show that H has asymptotically Poisson distribution with 
parameter A(c) (see e.g. [2]). O 


The above considerations give a new proof of a conjecture from [3] that a.e. G, 
with p=c/n, c>1, has an induced tree on @(c)n vertices where g(c)>0 is a 
function on c. This conjecture was confirmed independently and contem- 
poraneously by Fernandez de la Vega [4] and Frieze and Jackson [5]. In both 
papers the authors applied sophisticated algorithms constructing an induced tree 
in G, and gave their probabilistic analyses which are not straightforward 
calculations. The advantage of our approach is the much simpler way to succeed 
in proving the conjecture. However, in a case when c>1 is small our result is 
weaker than the ones proved in [4] and [5]. For example, it was shown in [4] that 
a.e. G, contains a tree on at least |B(c)n| vertices where B(c) is the positive root 
of the equation 


cB = log(1 + c2B). (10) 
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One can deduce from this that for p = 2.45/n the order T,(p) of the largest tree 
contained in G, is at least |0.6476n]. This is the best lower bound of T,(p) 
obtained until this time. On the other hand, taking into account the number of 
trees on k = [dn] vertices, by (7), (8) and (9) one can check that 


(i- oT} 


= -3 
EAL) = O(n | exp(6’c/2) 
=o(1) 


provided c=2.45 and 520.92. Consequently we see that for ae. G, with 
p=2.45/n 


0.6476n < T,(p) <0.92n. 


Observe also that for large values of c both a(c) and f(c) are equal to 
log c/c(1 +0,(1)) where o,(1) is a constant arbitrarily small for c large enough 
(compare this with Theorem 7). Let us remark here that the results about large 
induced trees were strengthened by Frieze and Jackson [6] by specifying that the 
induced tree can be taken to be a path. 


The next result deals with the orders of all possible maximal trees in the whole 
random graph G,,,. 


Theorem 6. Let p =c/n where c >1 is a constant, a be given by (3) and y(n) >. 
(i) A.e. G, has property 7, for every k satisfying 


1 
1<k< 7 (log n —3loglogn - ¥(n))| 


and for some k = B(c)n where B(c)>0 is the root of equation (10). 
(ii) Almost no G, has property J, for every k satisfying 


; (log n — 3 loglogn + y(n)) <k <a(c)n 
where a(c) is given by (5). 


Proof. (i) Denote by Z, the number of isolated trees of order k in a random 
graph G,. Then (see e.g. [1]) 
E(Z,) ~ nk*~2ck-'e-*"/k! 
and 
Var(Z,) ~ E(Z,){1 + (c — 1)(kc)*'e-**/k!}. 
Suppose 


1 
1<k<ky= [5 (log — 3 loglogn — y(n)) |. 
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Then clearly 


n 
Z.)~ : 1l—c)\k 
E( k) oo ly ki (ce ) 
and by Chebyshev’s inequality we obtain 


P(Z, =0) “Ezy + o(). 


Consequently 


el 
P(Z, =0 for some k < ky) = o|- > Kiet} 
k=1 


= of > Kieren} 
Nn i=o 

= O(e~*) 

= 0(1). 


Thus a.e. G, has an isolated tree (which is obviously a maximal tree) of order k 
for every k <kpo. 

The second part of (i) follows from Fernandez de la Vega result [4]. 

(ii) This is implied by (2) and Theorem 3. O 


Finally, let us assume that p = w(n)/n = o(1) where w(n) is a sequence tending 
to infinity as n—~. Then (see e.g. (2, p.137]) a.e. G, is such that every 
component of G,, with the exception of its giant component, is a tree of order 
o(logn). Clearly, the number of tree components is a monotone decreasing 
function when w(n) is increasing. Eventually, when w(n)=logn + y(n) where 
y(n) the last isolated vertices disappear and a.e. graph G, becomes 
connected (see e.g. [2, p. 150]). In our next result we examine orders of maximal 
trees in the giant component of G, with p=w(n)/n. Of course, if w(n)> 
log n + y(n) then by the ‘giant component’ we mean the whole graph G,. For the 
sake of simplicity let us put 


log w(n) 


iO== a 


(11) 
We have 


Theorem 7. Let p = w(n)/n = 0(1) and e >0 be a constant. Then a.e. G, is such 
that every maximal tree in its giant component has the order k, where 


(1 - e)nf(n)<k<(2+)nf(n). 


Proof. The fact that there is no maximal tree on k vertices for every k, where 
1<k<(1—«)nf(n) follows the same lines as the proof of Theorem 3. On the 
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other hand, by (7) we have 


= Ol pana (Sa) } 


=o(1) 


provided k = |(2+ €)nf(n)|. This implies that there is no tree of order greater 
than (2 + €)nf(n) and the proof is completed. O 


Having in mind the global structure of a random graph G, and Theorem 7 we 
can deduce immediately the following result about 7;,(p), the order of the largest 
tree of G,,. 


Corollary 8. Let p= w(n)/n=o0(1) and ¢>0 be a constant. Then for a.e. 
graph G, 


(1 — €)nf (n) < T,(p) < (2 + €)nf(n). 


The above is a generalization of a result shown in [8] where a case 
w(n) =e - log n was considered. Also our method of proof is much easier than an 
algorithmic approach applied in [8]. 

Let us notice here that in a case when p is fixed then (see [3]) the sequence 
{T,(p)} of random variables satisfies 

TP) 2 


> 


logn log 1/q 


asn—>x (12) 


in probability, where d = 1/q. (Even a stronger result is true, namely (12) holds 
with probability one and in any mean, see [7] or [9]). It is a strong evidence to 
presume that a result in a similar vain is also correct for a wide range of the edge 
probability p. Thus we state the following 


Conjecture 1. Let p = w(n)/n=o0(1) where w(n)—. Then the sequence 
{T,(p)} of random variables satisfies 


Tp) 
nf (n) 


in probability, where f() is given by (11). 


2 asn—-x 


Also we have found reason to conjecture that every integer between 
(1 + €)nf(n) and (2 — €)nf(n) is likely to occur as the order of a maximal tree of a 
graph G, with p = w(n)/n (i.e. we would have a similar property of G, as in the 
case when p is fixed—see Theorem 1). Thus our second conjecture is as follows. 
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Conjecture 2. Assume that p = w(n)/n = o(1) where w(n)— ©. Then a.e. graph 
G, has property 7, for every k satisfying 


(1+ €)nf(n)<k <(2-)nf(n) 
with f(n) defined by (11). 
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We generalize the concepts of a fragment and an atom of a graph and show that these 
generalizations have properties similar to the common concepts. We prove that a contraction- 
critical, finite graph G has at least |G[/3 triangles and that a finite graph G is 8-connected if 
every complete subgraph of G is contained in a smallest separating set of G. We study some 
further classes of graphs (almost critical graphs, C,-critical graphs) and discuss some 
applications. 


0. Introduction 


Properties of atoms and ends derived in [7] have turned out to be valuable tools 
in studying connectivity of graphs. But sometimes it is inappropriate to consider 
all smallest separating vertex sets and it is necessary to confine oneself to 
separating sets having special properties. So we will generalize these concepts in 
Section 1 and point out which properties of fragments, ends, atoms, and critical 
graphs carry over to these generalized concepts. 

In Sections 2 and 3 we will give some applications of the general results of 
Section 1. Especially, we shall see in Section 2 that some of the results of [2] are 
immediate consequences of the properties of G-atoms and S-ends. Moreover, we 
shall prove that every contraction-critical, finite graph G has at least 4|G| 
triangles, where |G| denotes the number of vertices of G. At the end of the paper 
we shall see that a contraction-cnitical, infinite graph does not necessarily contain 
a triangle, but it must contain one if it is locally finite. In Section 3 we consider 
C-critical graphs, i.e. graphs where every complete subgraph is contained in a 
smallest separating vertex set. We shall show that every C-critical, finite graph is 
8-connected, but I do not yet know if such a graph exists. If there were no 
C-critical, finite graph this would prove a well-known conjecture of Slater [13]. 

First we give some definitions and notation. All graphs considered in this paper 
are supposed to be finite, unless otherwise stated. The edge joining the vertices x 
and y is denoted by [x, y], the degree of x in G by d(x; G), and the subgraph of 
G induced by T < V(G) by G(T). For e=[x, y]e E(G), V(e):= {x, y} and for 
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TcV(G), 


N(T;G):= U V(e) — T. 


e€E(G)AV(e)NT FB 


The set of components of G is denoted by @(G). For a subgraph AcG, 
G-A:=G-—V(A) and also for V'c V(G), V’-A:=V’'— V(A). If no confu- 
sion seems possible, in set theoretic notation and in notation such as N(T; G) we 
do not always distinguish strictly between a subgraph and its vertex set. Z,, means 
the integers mod n. 

We write u(G) for the connectivity number of G. We call a separating vertex 
set T with |T| = u(G) a smallest separating set of G and define &(G):={G(T):T 
a smallest separating set of G}. A vertex x of G is called critical if u(G — x)= 
u(G) — 1. Setting Cr(G):= {x € V(G):x critical in G}, we call a graph G critical 
if Cr(G) = V(G). For T € &(G), a T-fragment is a union of at least one, but not 
all components of G — T. A fragment F of G is a T-fragment for any T € 3(G), 
and this uniquely determined T = G(N(F; G)) is denoted by T,. If F is a fragment 
of G, then F:=G —(F UT,) is a T;--fragment, too, the complementary fragment 
of F. (Sometimes we write more exactly F°.) An inclusion-minimal fragment is 
called an end and one of least order an atom. The order of an atom of G is 
denoted by a(G). 


1. General results 


In some proofs in Sections 2 and 3 we do not consider T-fragments for all 
T € (G), but only for certain ones. So we need some generalizations of the 
results on atoms and ends as proved in [7] and [11]. First we give the necessary 
definitions. 

For a graph G, let S be a non-empty set of subset of V(G) and define 
¥o(G):={T ¢ T(G):there is an SES with ScT}. An S-fragment of G is a 
T-fragment of G for any T € £2(G). An inclusion-minimal G-fragment of G is 
called an S-end of G and one of the least vertex numbers is an G-atom. The 
order of an S-atom of G is denoted by az(G). The graph G is called S-critical if 
and only if every S € S is contained in any T e 2(G) and for every S-fragment F, 
there is a T € 3(G) such that TN F #9 and TN (F UT;) does contain an Se S. 

We give some remarks and examples. Unlike as in the normal use of ‘critical’, 
an S-critical graph is never complete. Hence it has ©-ends and at least one 
S-atom. Of course, an S-atom is an S-end and an S-end is connected. If G 
is S-critical and )s.<3 $=V(G), then G is critical. Usually, S will have the 
property that for every S-fragment F, there is an S € S such that § c F U T; and 
SOF #9. For an © with this property, G is S-critical if and only if every Se S 
is ina Te Z(G). 
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Example I. 


(1) For any graph G and any positive integer k<|G|, define S:={Sc 
V(G):|S|=k}. Then the graph G is S-critical if and only if every set of k 
vertices of G is contained in a smallest separating set of G. Such graphs, as well 
as complete graphs K,, of order n >k, are called k-critical. Hence a graph G is 
k-critical iff |G|>k and u(G — S) = w(G) — k for all Sc V(G) with |S|=k. An 
(n, k)-graph is a k-critical graph of connectivity number n. 

(2) For S:= {6}, an G-critical graph is called almost critical. Including the 
complete graphs, we define a graph G to be almost critical iff FO Cr(G) #6 for 
every fragment F of G. Of course, every critical graph is almost critical, but not 
vice versa (cf. Example II). A non-complete, almost critical graph is 2-connected 
(see Corollary 1). 

(3) For a graph G and a positive integer k, define C,:={V(K):K is a 
complete subgraph of G with |K| =k or a clique of G with |K| <k}. (As usual, a 
clique is an inclusion-maximal complete subgraph.) Then the graph G is 
C,-critical iff every complete subgraph K of G with |K|<k is contained in a 
T « &(G). Hence a non-complete graph is C,-critical iff it is critical, and it is 
C--critical iff it is contraction-critical. 

(4) For a graph G, define C:=C;, for any integer k = |G| where C;, is as in (3). 
Then the graph G is C-critical iff every complete subgraph K of G is contained in 
a T € &(G). I have not yet found a C-critical finite graph. 

(5) For any connected graph G and any positive integer k<|G|, consider 
©:={ScV(G):|S|=k and G(S) connected}. Then G is G-critical iff every 
connected subgraph of order k is contained in a T € £(G). Hence this is also a 
reasonable generalization of ‘critical’ and ‘contraction-critical’. 


Before we consider some of the classes given in I in more detail in Sections 2 
and 3, we will point out in this section which results transfer to the general case. 


Lemma 1 (cf. Lemma 1 in [11]). Assume T,€3(G) for i=1,2 and let F be a 
T,-fragment of G for i=1, 2 such that T.N(R#UT,)UERN(K UT) contains an 
S eS. Then the following statements hold. 
(a) Assume FOF, #6. Then |F.NT)|=|F,NT| and if RNF, is no G- 
fragment, |F,N T,|>|F, T|| holds. 
(b) Assume F,0 F, #9. Then F,A F, = or FOF, is an S-fragment and FO F, 
is a fragment of G. 
(c) If F, is an G-end of G such that F,¢ F, and FOE, #9, then FF, =9. 


For the proof of Lemma 1 we remark only that in case F,. NF, #9, 
T:=FKNTZUTLNBZUBNT2NFO#;G) is a separating set containing an 
SeG, hence G(T) € &<(G) or |T|>u(G). From this, Lemma 1 follows in the 
same way as Lemma 1 in [11]. O 
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The proof of the following two lemmas also is a word-for-word translation into 
the general case of the proof of the corresponding results in [11]. 


Lemma 2 (cf. Lemma 2 in [11]). Let B be an S-end of G and assume there is a 
TeX(G) such that TO B#6 and TN(BUT=s,) contains an Se. Then the 
following statements hold. 
(a) If there is a T-fragment F of G with FO B #9, then BT and |B| <}u(G) 
or F <Tp and |F| <4u(G). 
(b) Bc T or BcT holds or there is a T-fragment F < Tz. 


Lemma 3 (cf. Lemma 3 in [11]). Assume there is an S-end B of G such that 
|B|>4u(G) and |B|=4u(G) and there is a Te 2(G) such that TO B #6 and 
TNM(BUTz,) contains an S€S. Then there is a T-fragment F such that F< T, 
and |F| <4u(G). 


Of course, the 7-fragments F, F in Lemmata 2 and 3 are S-fragments, as 
T €=s(G). We need still another simple property of fragments. 


Lemma 4. Let F be a T,-fragment of G for i=1,2 and assume Fic T,. Then 
I(T | >|Fil or RET, If T,€ Tz(G) and |F| = az(G), then |F,N T,| > |Fil. 


Proof. If F,¢ T,, then FA. F, #9 and by Lemma 1(a), [KN T| =|KN DI =|Al. 
For the second assertion, || =|/{ by definition of az(G) and hence in any case 
I_N T| 2 |Fil. O 


In a similar way as Theorem 1 in [7] follows, Lemmata 1(a), 1(b), and (4) imply 


Theorem 1. Let A be an S-atom of G and assume there is a Te 3(G) with 
TOA#® such that TO(AUT,) contains an SES. Then ACT and |A|< 
1 

2{T — Tl. 


Corollary 1. For every S-critical graph G, a=(G)<4u(G) holds. In particular, 
every ©-critical graph is 2-connected. 


It should be emphasized that even in an S-critical graph G, an S-atom A does 
not necessarily have the following property: 


Te3zs(G) and TNA##>ACT. 


It is not enough that there is an Se S in T, but there must be an Se S in 
TN(AU T,)! So it may happen that there are S-fragments F with FM A #9, but 
AG¢F. 

We search now for a generalization of Corollary 1 of Theorem 1 in [11], which 
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says that every non-complete critical graph G has two disjoint fragments of order 
at most 4u(G). The direct analogue is not true for S-critical graphs. Let us look 
at an example. 


Example II. Choose any integers m2=1 and k2=3. Let G(2i) be a K,,,,, for 
i=1,...,k-1, G(2i+1) a K,,,, for i=1,...,k-—2, G(1) and G(2k-1) a 
K,,, and G(0) a K,,_, such that G(i)N G(j) =® for i#j. The graph G,,,, arises 
from \3*>! G(i) by adding all edges between G(i) and G(i +1) for all i€Z2,. 
Then u(G,,,)=2m and Z(G,,,)={G(0)UG(2i+1):i=1,...,k-—2}U 
{G(1) U G(2k — 1)}. Hence G,,, is almost critical, but not critical, and G(0) is 
the only fragment of G,,, of order at most m. 


For some classes of ©-critical graphs we can get disjoint small fragments from 


Theorem 2. Let A be an S-atom of an G-critical graph G. Then there is an 
©-fragment F disjoint to A and an R<V(F) with |R| <4u(G) such that there is no 
SeS in FUN(F-R;G) with SO(F —R) #86. 


Proof. First we remark that an ©-fragment F disjoint from A satisfies the 
condition of Theorem 2 for R:=V(F) if |F|<4n, where n:=(G). 

Let BcA be an S-end. Define 2’:={T € 3(G): TN B#Q and there is an 
SeS in TN(BUTs,)} and S':={SeS:ScaTN(BUTs) for a T €X’}. Since 
G is G-critical, T’ #6 and S'#§. Consider any T € T’ oc Te(G) and suppose 
there is a T-fragment F with FO B #@. Then by Lemma 2(a), B or F has order at 
most 4n and is disjoint from A. So we may assume BcT for all TeX’ #9. 
Hence 4(G’) =n —|B| for G’:=G — V(B). 

Let B’ be an G’-end of G’. We may assume |B’| > 4u(G’) and |B’? | > 4u(G’), 
since B’ and B’® are also G-fragments of G, which are disjoint from A, because 
AcB. As G' has disjoint G’-ends, we may assume |B’ T,|<4n. Define 
R:=V(B'N Tz) and suppose B’ and R do not have the properties desired. Then 
there is an SEG such that Sc B’ UN(B’—R;G) and SN (B’— R) #9. This 
implies SoBUT, and SOB#§%. G being G-critical, there is a T € (G) 
containing S. Hence TeX’ and SeS’. Then T’:=T—Be Z(G’) satisfies 
T’NB’ #6 and ScT’N(B’UN(B’;G’)), since N(B’—R;G)=N(B’— 
R;G’). Hence we can apply Lemma 3 to G’, ©’, B’, T’, S and get a 
T’-fragment F of G’ of order less than 44(G’). Then F is an S-fragment of G of 
order at most 4n, which is disjoint from A. O 


Corollary 2. Every C,-critical graph G has two disjoint C,-fragments of order at 
most 44(G). 


Proof. Let A be a C,-atom and consider a C,-fragment F and Rc V(F) as in 
Theorem 2. Then F — R =Q, as every complete graph K with KN (F — R) #9 is 
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contained in F UN(F —R;G). Hence |F|<4u(G) and by Corollary 1, |A|< 
4u(G), as well. O 


Corollary 3. Every almost critical, non-complete graph G has disjoint fragments F, 
and F, such that |F,|<4u(G) and |F,N Cr(G)| <4u(G). 


Proof. An almost critical, non-complete graph G is S-critical for S:= {{x}:x 
Cr(G)}. Let A be an S-atom of G and consider an ©-fragment F and R c V(F) 
as in Theorem 2. Then Cr(G)N(F —R)=6 and hence |Cr(G)N F| <|R|< 
4u(G). O 

21 


2. Contraction-critical and almost critical graphs 


Thomassen proved in [14] that every contraction-critical graph does contain a 
triangle. His proof is more or less a proof of Corollary 1 for S:=C2, which 
immediately implies this fact. For let A be a C,-atom of G of order at most 
4u(G). If |A|=1, G contains a triangle, as T, contains an edge. If |A| > 2, then 
every edge of A is contained in a triangle of G. This is implied by the following 
easy lemma, which is used in [14] and explicitly proved in [2]. 


Lemma 5. /f a fragment F of G has an edge which is not contained in a triangle of 
G, then |F| = u(G). 


Proof. If [x, y}e E(F) is not in a triangle, then N(x;G)NN(y;G)=6 and 
N(x, G)UNM y;G)cFUT,. O 


The result of Thomassen means that every non-trivial graph G without triangles 
has an edge which is not in any Te 2X(G). Such an edge [x, y]e E(G) — 
UrexG) E(T) of a non-complete graph G is called contractible. It was proved by 
Egawa et al. in [2] that every graph G of connectivity number n = 2 and of order 
at least 3n without triangles has at least |G|+3n?—3n contractible edges. An 
essential part of their proof follows immediately from Lemma 3. To see this, 
consider a graph G of connectivity number n>2 without triangles and define 
S:={V(e):e € Urexg) E(T)}. We may assume © #9. Every S-fragment F of 
G has order at least n by Lemma 5, as |F| = 1 is impossible. Consider any G-end 
B of G. Then |B| =n and |B| =n holds and by Lemma 3 there is no T € 3(G) 
with TM B #® such that TM (BU Tz) contains an S e S. Hence every e € E(G) 
with V(e) B ¥§ is contractible. Choosing an S-end B’ cB, we get disjoint 
ends B and B’cB such that every edge incident to a vertex of BUB’ is 
contractible. This result was an important step in the proof given by Egawa et al. 
Another crucial step in their proof was the fact that every vertex z of G is 
incident to at least two contractible edges. To see this, define ©,:= 
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{{z, x}:[z, x] € Ureaia) E(T)}, assume S, #9, and consider an G,-end B. Then 
Lemma 3 implies again that all [z, x] € E(G) with x € B are contractible. As there 
is at least one such edge (because z € T,) and there is an S,-end B’ cB, we get 
two contractible edges in z. 

We study now the number and distribution of triangles. in contraction-critical 
graphs. We shall show that every C,-critical graph G has at least 4|G| triangles 
and that every C.-fragment of G contains a vertex which is on a triangle of G. 


Theorem 3. Let G be a C>2-critical graph and assume z € V(G) is not contained in 
a triangle of G. Then there are an x € N(z;G) and an edge [x, y}e€ E(G) such that 
there are at least d(x; G) — 3((G) — 1) triangles containing [x, y}. 


Proof. Denote n:=u(G) and define G:={{z,x}:[z,xJe E(G)}. As every 
©-fragment F has FM N(z;G)#9, the graph G is S-critical, because it is 
C>-critical. Let us consider an G-atom A. Applying Theorem 1, we get 
|A| <4(n -1), since ze T for all Te Ts(G). As z is not on a triangle, but 
T, € &s(G), we have |A| = 2. Hence there is an [x, y]€ E(A) with x € N(z; G). 
Denoting c:=|N(x;G)NN(y;G)|, we get the inequalities d(x;G)—c+ 
d(y;G)-c+c<|A|+|T|<4(3n—1), which implies the claim that c= 
d(x,G)-4(n-1). O 


Remark. For n = 4, we get |A| = 1 for the S-atom A in the proof of Theorem 3. 
Hence in a C,-critical graph of connectivity number 4, every vertex z is on a 
triangle which contains a vertex x # z of degree 4. This was used by Fontet in [3] 
and [4]. 


Theorem 4. Every C>-critical graph G has at least }|G| triangles. 


Proof. Let «(G) denote the number of triangles of G and for e € E(G), let t(e) be 
the number of triangles of G containing e. Define H:=(V(G), {e € E(G):t(e) > 
0}) and J:={zeV(G):d(z;H)=0}. For zeJ, we choose an x:=f(z) in 
N(z;G) as in Theorem 3. Denoting n:=y(G), then |f~'(x)|<4(n — 1) for 
xéX:=f(J) by Theorem 3 and, therefore, Up. yjexcun t([x, y}) = 2(d(x; G) — 
Mn —1))2=n+1>2(|f~'(x)| +1) for x €X by Theorem 3. So we get 

(@@) % dX ale, y)> 21+). 


xeX [x.yleE(1/) 


Since every y € Y:= V(G) — (X UJ) is on a traingle in G, we have also 
(6) XY 2D  elyx))>2I¥1. 


yey [y.xJeE(H) 


Addition of (a) and (8) gives the assertion 
2IGI< } 2D  exy))=6G). O 


xEV(G) [x.yleE(H) 
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One could conjecture that every vertex of a contraction-critical graph is on a 
triangle. But this is not true, as the following example shows. 


Example II. Take any even integer m = 4 and any integer k = 2m + 1. Consider 
k disjoint complete graphs K' of order m, say, V(K') = {ai,..., ai,}. The graph 
R;,.x then arises from f_, K' by adding all edges [a’, ai3}] forh=1,...,4m+ 
1,ieZ,, andj €Z,,. Asm 24, there is an independent vertex set U with |U| =k, 
for instance, {a!,a?,..., a%}. Construct R,,, from R’,, by adding any further 
vertex z and all edges [z,u] for we U. Then u(R,,,)=2m+1 and R,,, is 


C,-critical, but there is no triangle containing z. 


In Section 3 we will need the fact that every C,-fragment of a C2-critical graph 
G has a vertex contained in a triangle of G. For this we need 


Lemma 6. Let B be an independent vertex set of G such that A:=G — B is not 
independent and |B| = |A| 22 holds. Assume d(a;G)=|B| for allaeA and there 
isanaeA with d(a;G)>|A|. Then there is a triangle D in G with DO B#9. 


Proof. By induction on |A|. The claim is obvious for |A| = 2. Suppose |A| > 2. If 
A is disconnected, we can delete a least component of A and by induction get the 
triangle desired. Let us assume that A is connected. By assumption, there is an 
ay€V(A) with d(ay,G)>|A|. There is an a#a,y in A such that A — {a} is 
connected. Since d(a; G) = |A|, there is a b € BM N(a;G). If there is no triangle 
containing [a, b], then G’:=G — {a, b}, B’:=B— {b}, and A’:=A — {a} satisfy 
the conditions of the lemma. Then, by induction, there is a triangle D with 
DOB'#%. O 


Theorem 5. Let F be a C,-fragment of a C>-critical graph G. Then there is a 
triangle D in G with DO F #9. 


Proof. Choose a C,-end BcF. Let us suppose there is no triangle in G 
containing a vertex of B. Then |B| > 2 as E(T,) #9 and hence |B| > u(G)=:n by 
Lemma 5. Define ©:={V(e):ee E(G) and V(e)M B #6} and let A be an 
€-atom of G. Since T, contains an edge incident to an x € B, we have |A| = 2 and 
N(x;G)NA#§. Then {x, z} € S for z e N(x; G)NA #6 and Theorem 1 implies 
|A|<4(n — 1). But then BN A =@ by Lemma 5, since |A| => 2. Hence BNA #9, 
since |B| =n. B being a C>-end, BNA cannot be a C,-fragment. Hence Lemma 
1(b) and (a) implies BN A = and |BN T,|>|A A Tp| = |A|. Let us consider the 
graph H:=G(AU(BN T,))— E(BNT,). Then d(a; H) =d(a; G —(T, — B))= 
n—|T,— B| =|T,0 Bl for allae A. As |T,N B| >|A| =2, we can apply Lemma 
6 to H for V(BMT,) and A and get a triangle D intersecting BN T,, a 
contradiction to our assumption. O 
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We turn now to almost critical graphs. These graphs occur in a natural way; for 
given an atom A of a contraction-critical graph G, G — A is almost critical. This is 
generalized in 


Lemma 7. Let A be an G-atom of an S-critical graph G. If for every x € T,, there 
is an S €S containing x such that SOA#% and SCAUT,, then G — V(A) is 
almost critical and V(T,) < Cr(G — V(A)). 


Proof. Consider G’:=G — V(A) and any x € T, #9. There is an S € © such that 
SNA#G and xeScCAUT,. G being S-critical, there is a T € $(G) containing 
S and AcT by Theorem 1. Hence p(G’)=pu(G)—|A| and x eCr(G’). As 
F'( T, #9 for every fragment F’ of G’, G’ is almost critical. O 


Remark. Instead of the assumption ‘A is an ©-atom’, it is enough to assume in 
Lemma 7 that a non-empty AGV(G) has the property: If TNA #® for a 
TeX(G) and TN(AUN(A;G)) contains an SeS, then AcT. So, for 
instance, it is true that G-—z is almost critical for any vertex z of a 
contraction-critical graph G. 


In Theorem 2 of [11] it was proven that every non-complete, critical graph has 
four disjoint fragments. This is not true for almost critical graphs, as Example II 
shows. A still simpler counterexample is the following graph: For any integer 
n=3, choose a complete graph K of order at least 2n, add two adjacent vertices 
@,, a, and join a; to a set A; of exactly n — 1 vertices of K for i= 1, 2, such that 
|4,UA,|>n. This graph is almost critical and does not have four disjoint 
fragments. Analysing the proof of Theorem 2 in [11], it is easily seen that the 
same arguments even prove the following more general result. 


Theorem 6. Every almost critical, non-complete graph G has fragments F,, Fy, F;, 
F, such that F,, F,, Fy, and F,Q Cr(G) are disjoint. 


Since F 0 Cr(D) #6 for a fragment F of G, F, ¢ F and F, ¢ F, for i = 1, 2, 3. In 
Theorem 6, we can choose F, as an atom of G and get, in addition, that 
F, 0 F,=9. But it is not possible to achieve, moreover, F, F,=9, as Example II 
shows. 

Egawa proved in [1] that a(G) <}(u(G)) for every non-complete, contraction- 
critical graph G. Considering an atom of G, this result follows immediately from 
Lemma 7, Theorem 6, and Lemma 4. 


3. C,-critical graphs 


In this section we will study mainly the connectivity of C,-critical graphs. We 
shall show that C;-critical graphs are 6-connected and that every C-critical graph 
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G is 8-connected and does not contain a complete graph of order w(G) — 2, 
if it is minimally u(G)-connected. Unfortunately, I have not succeeded in 
deciding if there are finite C-critical graphs. This class of graphs is related to a 
well-known conjecture of Slater [13], which says that for all positive integers n the 
only (n, |4n] + 1)-graph (cf. Example I (1)) is K,+;. (Some stronger conjectures 
are found in [10].) Let us verify that there is no non-complete (n, |3n] + 1)- 
graph, if u(G)>n for every C-critical graph G. Suppose G is a non-complete 
(n, [dn] + 1)-graph. If there is no C-critical graph of connectivity number n, 
there must be a complete subgraph K c G not contained in any T € [(G). As G 
is ([4n] + 1)-critical, |K| > |4n] + 1. But G cannot contain such a large complete 
subgraph (see Theorem 1 in [12]). If there were no C-critical graph, therefore, 
Slater’s conjecture would be settled. It is obvious that there is no C-critical line 
graph L(G), because 0 # {[a, x]:=x € N(a; G)} c T € X(L(G)) for any a € V(G) 
would imply the contradiction that T — {[a, b]} is also separating in L(G) for 
beN(a;G). Hence Slater’s conjecture is true for line graphs, as proved in 
[6].-For proving Slater’s conjecture, it would be sufficient to show that every 
k-critical graph does contain a K,,,. This could be true for all k and certainly is 
true for k = 2 by Thomassen’s result, which we generalized in Section 2. But I do 
not know if every 3-critical graph must contain a K,. I do not even know if there 
is any k such that every k-critical graph must contain a K,. 

First, let us consider examples showing that for every positive integer k there 
are C,-critical graphs which are neither C;,,,-critical nor 3-critical. 


Example IV. For positive integers k and m> 3k define H,,(k):=(Z,,, {[x, x + 
k]:xeZ,, and k=1,...,k}). Then u(H,,(k)) = 2k and H,,(k) is C,-critical, but 
neither C,,,-critical nor 3-critical. The graphs H,,,(k) also show that for given k, 
there are C,-critical graphs of connectivity number 2k having arbitrarily large 
order. This is in contrast with the results for 3-critical graphs in [9], where it was 
proved that every (n, 3)-graph has less than 6n? vertices. 


For the proof of the fact that every C-critical graph is 8-connected, we need a 
series of specialized lemmata. 


Lemma 8. Let B be a fragment of a graph G with V(B) = {a, b}. Ifae T € &(G), 
then b € T or V(T — B)< N(b;G). 


The easy proof of this lemma is left to the reader. 


Lemma 9. Let F, be a T;-fragment of G for i=1,2 such that F.N FR +#8 and 
consider BST,NT. Then |F,NT,|=|KNT|+|B)—-|N(B;G)N FNAB or 
V(F, FB) N(B;G). If IN(B; G) NN Bl <|B| <2, then |, BI> |b. Til or 
V(F.O B)= {2} with Nz, G)= VAN B)UV(T,N B)U VN 1). 
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Proof. Suppose R := V(F, NB) — N(B; G) #9. ThenN(R;G)CN(B;G)N FANE 
URN GBUERNTU(TZAT-B) and hence = |T|= n(G)<|N(R; G)| < 
IN(B;G)N KOK +/FAN Gl + 1A T|+|/T% 9 G| —|Bl, which implies the first 
inequality. If |N(B; G)O F,.NA| <|B| <2 and |KNG|=|KNT| (cf. Lemma 
1(a)), then V(F, B)< N(B;G) by the assertion yet proved. Hence V(F,N 
F) = {z} and |N(z;G)|<|R,N BUT,N GUN GI =|T|="(G), proving the 
second assertion. [J 


Lemma 10. Let B be an S-fragment of G(S € 3(G)) with |B| <2, T € Z(G) with 
BcT, and Ce ©&(G — T) such that |CN S| =|B| and |C|=2. Let K’ be a clique 
of G such that K'NCNS#6 and K'NB#®, and assume T'e€ 3(G) with 
T' 2K’. Then BcT'’ and the following two statements hold. 

(1) If C’'NC #6 for C’eC(G —T’), then |C'NT|>|CNT’| and |CNT'|> 
IC’ TI. 

(2) |C’NT|=2 for all C'e S(G—T’), |CNT’|22, and |CNT'|=2 or 
ICNT’|=|C| =1. 


Proof. Suppose B¢ T’. Then |B|=2 and by Lemma 8 we have N(b;G)> 
V(T’—B) for the vertex b€ B—T’. But this is impossible, because T’— B 
contains the clique K’. Hence Bc T. 

Since COSNT’#6 and |CNS|=|Bl, we have |C’NCNS|<|B| for all 
C’eG(G-—T’). As T’NG(CUT) contains the clique K’, there cannot be a 
vertex z with N(z;G)>V(T’N(CUT)). If C’'NC#OH for C’e C(G—T’), 
therefore Lemma 9 implies |C’N T|>|CNT’| and |CNT’|>|C’NT|. Hence 
(1) follows. 

Consider any C’ € @(G — T’). As T’ contains a clique, |C’| = 2. If C’N C #8, 
then C’NT #9 by (1). If C’N C =§, then C’N T #8, because T’ MN C>K'N 
C #§@. Hence C’N T #9 for all C’ e ©(G — T') and T’NC #86. 

For proving |C  T’| 2 2, we may assume C ¢ T’, because |C| > 2. Hence there 
is a C'e @(G—T’) with C’N C #8. But then (1) implies |CN T’|=2, since 
C'NT #8. 

Consider any C’ € ©(G — T’). We may assume again C’ ¢ T. If C’'N C #9, (1) 
implies |C’N T|=2, as CNT’ #§. If C'NC #6, Lemma 1(a) implies |C’N 
T|=|CNT’|=2. Hence |C’N T| 22 for all C’ e €(G — T’). 


If |C|\=2 and C¢T’, then Lemma l(a) implies |CNT’|>2, because 
IC’NT| 22 for all C’e C((G —T’). O 


Lemma 11. Let G be a C-critical graph with u(G) <7 and let B be an S-fragment 
(S€&(G)) isomorphic to Kz. Assume K is a clique in G with KN B+#® and 
KcTeX(G). Then |TNS| <1. 


Proof. We will obtain a contradiction from the assumption |T N S| = 2. 


As T-—B contains the clique K, B&T by Lemma 8 and |C|=2 for all 
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Ce ©(G — T). Hence Lemma 4 implies |CN S|=2 for all Ce ©(G — T). Since 
|TNS|22 and uw(G)<7, there is a CeS(G—T) with |CNS|=2, say, 
V(C/NS)={c,¢é}. This notation c, ¢ may be chosen in such a way that 
N(c;G) > V(B), if there is such a vertex in CNS. Asc € V(S) = N(B; G), there 
are cliques K’ with ce K’ and K'NB#§. Choose such a clique K’ so that 
|K’'N TNS| is maximal. G being C-critical, there is a T’ € £(G) containing K’. 
Applying Lemma 10, we get Bc T’ and 

(a) |CNT’|=2, |CNT’|=2, and \C’ NT|=2 for all C'e ©(G ~- T’). 

Next we prove 

(b) Assume there is a C'e S(G —T’) with C'N C#§. Then |CNT’|=|C'N 
T| =3, TOT’ =B, and u(G)=7. Furthermore, C’ < T and |C’| =2 hold. 

From (a) and TN T’ >B we get 72 |T|2=|C’N T| + 2+ 2, hence |C’N T| <3 
and in the same way |CM T’| <3. On the other hand, (a) and the first part of 
Lemma 10 imply |C’N T| 23 and |CN T’| 23. So |C’NT| =3=|CNT’|. From 
|C'N T| =3 and (a) we get T’N T=B, |C’NT|=2, and u(G)=7. But then 
C’NC =, because C’N C #G implies |C’ N T| > 3 in the same way as above for 
C’. Lemma 1(a) shows C’N C = 9, since |C’N T|<|CNT’|. Hence C’ ¢ T. 

(c) If N(c; GNA TOS #9, then T'NTOS #9. 

Assume there is an x € N(c;G)N TNS. If V(B)c N(c; G), there is a triangle 
D2 {c,x} with DO B#§. If V(B)¢ N(c;G), then by the choice of c, also 
V(B) ¢ N(¢’; G). But this implies V(B) < N(x; G) and there is again a triangle D 
as above. Hence, by the choice of K’, |K’N TNS|2={DNTNS|=1, which 
implies |T’N TN S| > 1. 

(d) CeT’ and 2<|C| $3. 

Suppose C¢ T'. Then there is a C’e G@(G— T’) such that C/N C #9. Then 
C’ cT and |C’| =2 by (b) and Lemma 4 implies |C’N S|> 2, hence C’ c TNS. 
From ceT’ we get N(c;G)NC'#6, hence N(c;G)NTNAS#§. But this 
implies T’1 TOS #§@ by (c), in contradiction to T’N T =B which holds by 
(b).The bounds 2<|C| <3 follow from (a) and |T’N T|=|B| = 2. 

(e) |C|=2. 

Suppose |C| #2, hence |C| =3 by (d). Then there is at most one C’ e &(G — 
T’) with C'NC#, because C’NC#OH implies |C’NT|=|CNT’|=3 by 
Lemma 1(a) and (d). Hence there are C’ €e ©(G — T’) with C’ c T and then there 
is even a C’e ©(G — T’) such that C’ c T and |C’| = 2. Then by Lemma 4 again 
C’cTNS and hence N(c:G)N TNS #9. But then (c) implies |TOT’|= 
{B}+1=3, hence 72=|C|+|TONT’|+|CNT'|26+/CNT"|, thus contradicting 
(a). 

(f) T’NTONS #@and |TNS| =2. 

Since |C| = 2 by (e) and |T ON S| > 2 by assumption, N(c; G)N TNS #9 holds. 
Hence T’N TNS #® by (c). Now suppose |T S|=3. Then 2<|CNS|<7- 
ITA S|-|CNS|<2 by Lemma 4. Hence |CNS|=2 and CeG(G—T). But 
then we can take C instead of C and get also C < T’ and |C| = 2 from (d) and (e). 
As |C’N S| 22 for all C’e G(G — T’) by Lemma 4, we get the contradiction 
72=(SJ=|T’NS|+IC'NS|+|C'NS| 244242. 
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Using properties (a) through (f), we can now complete the proof of Lemma 11. 
Since [TN S|=2 and T’N TNS #6 by (f), we have |C’N TN S| <1 for all 
C'e ©(G — T’). Hence C’ ¢ T and, therefore, C’ NC #9 for all C’e €(G —T’), 
as CcT’ by (d). Furthermore, |C’M T| =2 for all C’ e €(G — T’) by (a), as 
|T’NT|=|Bl +|T’'NTNS|=3. Since |CNS| <3, there is a C’e 6(G—T’) 
such that |C’N CN S|<1. Then C’N TNS #6 and by Lemma 9, C’N C consists 
of a single vertex z with N(z;G) > V(T — C’). Since |TN S| =2, T’'N TNS #8, 
and C’N TNS #9, we get V(B)UV(TNS)cV(T - C’)c N(z;G), which is 
impossible, because the clique K is contained in G(BUTNS). O 


Lemma 11 immediately implies 


Lemma 12. Let G be a C-critical graph with u(G) <7 and let B be a fragment of 
G isomorphic to K,. Then there is no triangle D in G such that |DQ B| =1. 


Now we are prepared to prove the main result of this section. 
Theorem 7. Every finite, C-critical graph G is 8-connected. 


Proof. We assume u(G) <7. Define S:={V(K):K clique of G with |K|=3}. 
By Thomassen’s result [14] (cf. theorem 4), G6 #§. By Theorem S, G is G-critical. 
Choose an G-atom A of G and define §:= N(A; G). By the definition of S, there 
is a triangle D in G(S). By Corollary 1, the assumption, and the definition of S, 
2<|A| <3. 

If |A| =2, then there is an ae A with |N(a;G)N D|=2 and hence there is a 
triangle D’ with |D'’N A| =1, contradicting Lemma 12. Hence |A| = 3. 

By Theorem 5, there is a triangle D with DN. A #®. Choose a triangle D with 
DNA#® so that |D NM S| is as large as possible. Let K be a clique containing D 
and Te %(G) containing K. Then Te Z(G) and AcT by Theorem 1. So 
|ICNS|=3 for all CeC(G—T) by Lemma 4. This implies “(G)26 and 
|T 1 S| <1. Hence |DM S| <1 and by the choice of D, there is no triangle D’ 
with |D’NA|=1. It is easy to see then that A is a triangle and G(DU 
A)-(E(D)UE(A)) consists of three disjoint edges [d,,a,], where V(D) = 
{d,, d,, d3} and V(A) = {a,, a2, a3}. Then S’:=(S — {d,}) U {a,} is a smallest 
separating set of G and B:=A — {a,} is a component of G—S’. As u(G)2=6, 
there is an se S—D. But then s, a,, a2 span a triangle D’ with |D’N B| =1, 
contradicting Lemma 12. UO 


Remark. At the end of Section 2 we saw that a result of Egawa [1] follows 
immediately from Theorem 6. If we had a result similar to Theorem 6 for all 
©-critical graphs, (perhaps) it would be possible to derive Theorem 7 from such a 
result in an analogous way. 


In the next theorem we shall prove that a C-critical, minimally n-connected 
graph cannot contain a K,-2. (A graph G is called minimally n-connected, if 
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u(G)=n, but w(G —e)<n for all ee E(G).) This is not right for C;-critical, 
minimally n-connected graphs as the graphs H,,(3) from Example IV show. We 
give yet another example. 


Example V. For integers m>i21, let G,,; arise from K'U K” by adding the 
edges [xi,x2,,] for A=1,...,i and xeZ,,, where K' and K” are disjoint 
complete graphs of order m with V(K*) = {x/,..., x*,} for k =1, 2. The graphs 
G,,,, are (m+i-—1)-regular and (m +i-—1)-connected. As for m2 2i-—1 and 
j <i, every complete subgraph of G,,,; of order j is contained in a complete 
subgraph of order j + 1, the graphs G,, ; are C;-critical for all m =2i— 1. Hence, 
for all i2=1 and n>max(3i — 2, 2), there are C;-critical, minimally n-connected 
graphs which do contain a K,,4,_,- 


It is well known that a non-complete, minimally n-connected graph cannot 
contain a K,,, for n=2 (cf. [5] or [8]). A C2-critical, minimally n-connected 
graph G cannot contain a K,, because such a subgraph K,, has a vertex z of 
degree n in G (see [8]}) and then for the vertex x e N(z;G) — V(K,,), the edge 
[z, x] cannot be in a Te Z(G). (It is well known that there are no C,-critical 
graphs of connectivity number 2. This follows, for instance, from Theorem 1, 
which shows that a C2-atom of a C>-critical graph G has order at most 
4(u(G)—1).) I do not know if a C;-critical, minimally n-connected graph can 
contain a K,,_,, but this seems unlikely. If minimality is not assumed, there is no 
upper bound for the order of a complete graph contained in a C;-critical graph of 
connectivity number n. This is shown by the following 


Example VI. Take any positive integers k >i and define n:=(k + 1)i. Let H bea 
bipartite graph with bipartition A, B (i.e. A, B is a partition of V(H) into 
independent vertex sets) and let B,,... , B, be a partition of B. Fork =1,...,k, 
the graph H(A U B,) may arise from a complete bipartite graph K;,,.;.,; with 
bipartition A, B, by deleting the edges of a 1-factor. Then N(A’; H) = B for all 
A’cA having at least two elements, hence 


(Z)|N(A’; A)| =k(i4+ 1) ekit+i=n. 


Define H:=HUC, where C is a complete graph with V(C)=A. Then 
d(a; H)=n for all ae A. Let K be a complete graph of order at least k(i + 1). 
For j J, let H; be a copy of H, where A, ¢ V(H;) and Bc V(A)) correspond to 
A and B, respectively. Assume A; A;.=9 for all j#j’, A; V(K)=@ and 
Bc V(K) for all j €J such that lor every Sc V(K) with |S| =i, there is a jeJ 
with ScB. Then the graph G:=KU\),.,H, is n-connected by (Z). It is 
C,-critical, as well, because for every complete subgraph K;, of G with i’ <i, 
there is a vertex z € )., A, such that d(z; G) =n and N(z; G) > V(K;,). 
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Theorem 8. A C-critical, minimally n-connected graph G cannot contain a 
complete subgraph K,,_>. 


Proof. We suppose that G does contain a K,,_,. Let K be a clique containing this 
K,,_2. There is a T € &(G) containing K. By Theorem 7, n = 8. Hence there is a 
b € V(K) with d(b; G) =n by [8]. As |N(b; G) — T| <3, there is a Ce ©(G — T) 
with |CM N(b; G)|=1, say, CN N(b; G) = {a’}. Let K' be a clique containing 
{a'} UN(a’;G)OK. In particular, b € K’. There is a T’ € [(G) containing kK’. 
As |C| = 2 and |C|=2, we get |CNT'|=2, |CNT’|=2, and |C’N T|=2 for all 
C'e€(G —T') from Lemma 10 (2). There is a C' e €(G — T’) with C'N K =9. 
Then C'N T =T — K and |K| =n — 2, as |C'N T| =2. This implies C’'N C = @ by 
Lemma 9, as |CNT'|=2. As C'NT =T — K and K'cT'", of course, C’'N Tc 
K —K'. As N(a';G)N(K — K') =® by choice of K' and a'€ CNT", we see 
C'NC #8. Then |CNT'|>|C'NT| by Lemma 9 and so C'N C = 9 by Lemma 
1(a). This implies C’<cT and hence C'=T-—K. As beT"', there is an 
aeN(b;G)NC'cT—K. Setting N(b;G)—K ={a,a',a"}, it follows that 
N(b; G)NC = {a"}. Starting from C, a” instead of C, a’ and defining K", T” 
analogously, we see as above that T — K is a component of G — T”, as well. But 
then V(T')=N(T —K;G)=V(T") and hence {a',a"}cN(T—K;G). Then 
N(a; G)N {a', a"} #9, say, [a, a'}e E(G). But then b, a, a' span a triangle D. 
There are a clique K* > D and a T* € [(G) containing K*. Then by Lemma 10 
(2) again, |C*NT|>2 for all C* e€C(G—T™*), contradicting the fact that 
|IT-(KUT*)|<1. O 


Slater’s conjecture [13] that there is no non-complete (2k — 1, k)-graph is 
obviously equivalent to the conjecture (S,): Every non-complete, k-critical graph 
is (2k)-connected. Concluding this paper, we will show that for k<3, (S,) 
remains true, if we replace ‘k-critical’ by ‘C,-critical’. (If this were true for all k, 
of course, there would be no C-critical graph.) For k = 1, this is well known (see 
Corollary 1). For k = 2, the assertion follows from Tutte’s construction-theorem 
for 3-connected graphs [15]. For the only contraction-critical graph of connec- 
tivity number 3 is the complete graph K,. (Cf. also the paragraph before Example 
VI.) We now give a proof for k = 3. 


Theorem 9. Every C3-critical graph is 6-connected. 


Proof. We assume n:=p"(G)<5. Without loss of generality, we suppose G 
minimally n-connected. G cannot be C-critical by Theorem 7, hence there exists a 
K,<G and by [8] there is a b € V(K,) with d(b;G)=n. There cannot be a 
complete graph K c T,:=G(N(b; G)) with |K| <2 such that 7, — K is complete, 
because there would be a T € 3(G) containing K U {b}, which would have to 


separate the complete graph JT, — K. As D:=T, 1 K, is a triangle. this implies 
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u(G)=5. Define {d,, d,, d3}:=V(D) and {a,, a.}:=V(T,) — D. First we prove 
(Z) E(T,)=E(D). 


As we have seen above, [a,, a,]¢ E(G), so it is enough to show that the 
assumption [a,, d,]}€ E(G) leads to a contradiction. By C;-criticality, there is 
then a Te Z(G) containing {b, a,,d,}. As a, and D —T belong to different 
components of G — T, there is a C e ©(G — T) such that C/N T, = {a.}. Asa, eT 
and {a,, a2] ¢ E(G), |C| = 2 holds. Let K be a clique of G({a,, d2, d3}) containing 
a>. Then |K|<2, as CAN {d2, d,} #9. By Cy3-criticality, there is a T' e T(G) 
containing {b} UK. As T' separates a, and D-—T' we see d,e€T'. As 
G({b} UK) or G({b, d,} UK) is a clique in G, we can apply Lemma 10 (2) and 
get |C'N T| 22 for all C’ e ©(G —T'), contradicting |T'N T| = |{b, d,}| = 2 and 
u(G)=S. 

The graph G({b, a;}) is a clique of G for i= 1, 2 by (Z). There is a T;€ E(G) 
containing {b,a;} for i=1,2, and TJ, —{a,,a,} being complete, there is a 
C; € C(G — T,) with C,N T, = {a;,,} for ie Z,. As T; contains a clique, |C;| 2 2 for 
i=1, 2. So we can apply Lemma 10 (2) and get |C 7;| =2 for all Ce ©(G — 
T,.1) and ie€Z,. This implies {CN 7;| =2 for all C € @(G — 7,,,) and 7, QT, = 
{b}. Hence C,;¢ T,,1 for i€ Z, by Lemma 9 and so |C,| =2 for i=1, 2. Since 
CNT, = {ap}, D:=G(C,UC,) — {a2} is a triangle. Hence there is a T € T(G) 
containing D. Then C,UC,¢T by Lemma 8, since [a,, a,)¢E(G). Since 
T,0 T,={b}, there is no vertex z with N(z;G)>V(C, UC). So Lemma 4 
implies |C N T,| = 2 for all Ce G(G — T), which is incompatible with |7 AN T,| = 
|C2,;=2. O 


Finally, we will touch briefly upon the infinite case. First, we shall show by an 
example that the results of this paper are not true for infinite graphs, in general. 


Example VII. Let T be a tree regular of degree 4. Assign a quadrangle Q(x) to 
every x €V(T) such that O(x)N Q(y) = for x #y. For every [x, y}eE(T), 
identify an edge of Q(x) with an edge of Q(y) in such a way that every edge of 
any quadrangle is identified with exactly one other edge. This is obviously 
possible. The resulting graph has connectivity number 2 and is contraction- 
critical, but it has no triangles. Hence it is even C-critical and represents a 
counterexample to the conclusions of Theorems 7 and 9 in the infinite case. 


The graph constructed above is regular of degree No. As usual, for locally finite 
graphs, we have a better chance to get results similar to the results in the finite 
case. 


Theorem 10. Every contraction-critical, locally finite, infinite graph G has an 
infinite number of triangles. 
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Sketch of Proof. Choose any zeV(G) and define G:={V(e):e€ E(G) and 
zeéV(e)}. First we prove 


(Z) There is a finite S-fragment of G. 


Consider an ©-fragment F of G such that |F MN N(z; G)| is minimal. There are 
an x € FO N(z; G) and a T € &(G) containing {x, z}. Of course, T is finite, since 
G is locally finite. So we may assume that there is a Ce©(G—T) with 
COF#®. As |N(z; G)NCNF|<|N(z;G)NF|, CONF is not an ©-fragment 
and so CN F =@ as in Lemma 1(b). If CO F #8, then also CN F =9, and hence 
F cTisa finite S-fragment. If CON F =, then Cc 7; is a finite S-fragment. 

By (Z) an G-atom A of G is finite and hence Theorem 1 remains true for A, as 
is easily seen, But then Theorem 3 follows in the same way as in the finite case. 
Theorem 3 obviously implies that there are an infinite number of triangles in G. 

It is no problem to check that also Theorem 5 remains true for locally finite 
graphs, whereas the proof of Theorem 8 works even for all graphs of finite 
connectivity number n = 5. 
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The euclidean dimension of a graph G, e(G), is the minimum n such that the vertices of G 
can be placed in euclidean n-space, R”, in such a way that adjacent vertices have distance 1 and 
nonadjacent vertices have distances other than 1. Let G= K(n,,..., 54:4,,) be a complete 
(s +¢+4u)-partite graph with vertex-classes consisting of s sets of size 1, ¢ sets of size 2, and u 
sets of size =3. We prove that e(G)=s+¢+2u if t+u22, and e(G)=s+r+2u-1 if 
ttusl, 


1. Introduction 


Let U” be the unit distance graph of euclidean n-space, R", i.e. the vertex set of 
U" is R” and two vertices are adjacent if and only if the distance between them is 
1. For a simple graph G, the euclidean dimension of G, e(G), is the minimum 
integer n such that G is isomorphic to an induced subgraph of U”. 

Erdos et al. [2] defined the dimension of a graph G, dim(G), as the minimum n 
such that G is isomorphic to a subgraph of U". Recently, Buckley and Harary [1] 
introduced the term “‘euclidean dimension” in the present sense. 

The euclidean dimension of the complete graph K, is clearly p—1. The 
euclidean dimension of the complete bipartite graph K(p, q) was obtained by 
Lenz (see [2]). 


Theorem A (Lenz). The euclidean dimensions of the complete bipartite graphs 
are as follows: 

e(K(1, 1)) = e(K(, 2)) = 1: e(K(1,n))=2 for n>2,; 

e(K(2, 2)) =2; e(K(2,n))=3 for n>2; 

e(K(m, n))=4 for m,n2=3. 


Buckley and Harary [1] determined the euclidean dimension of the complete 
tripartite graph K(p, q, r). 


Theorem B (Buckley and Harary). The euclidean dimension of complete tripartite 
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graph K(p,q,7r), p <q <r, is given by 


2 fp=q=i, rs2 

3 ifp=q=1, r2=30rq=r=2 
e(K(p, 4,r))=44 ifq=2, r23 

5 ifps2, q23 

6 ifp 23. 


In this paper we present the following general result. 


Theorem C. Let 
G(s, t,u)=K(,...,1,2,...,2,m1,..., Mu) 
ee a 


$ t 


be a complete (s+t+u)-partite graph with n,>=3, s, t, u being possibly 0, 
st+t+u22. Then 


stt+2u ift+u22 


Gee ha Pee een ift+usl. 


2. Proof 


For a nonempty set X in R”, let L(X) denote the affine subspace spanned by _X. 
First we prove the following lemma. 


Lemma 1. Let X, Y be two nonempyy sets in R” such that each point of X is unit 
distance apart from all points of Y. Then 


dim L(X U Y) = dim L(X) + dim L(Y) + ¢, 


where €=0 if L(X)O L(Y) #@ and ¢ =1 otherwise. Furthermore, if L(X) and 
L(Y) intersect, then they intersect perpendicularly at the common center of the two 
spheres, the minimum-radius sphere passing through all the points of X, and the 
minimum-radius sphere passing through all the points of Y. 


Proof. If X is a singleton set then the lemma follows immediately. Suppose that 
X contains at least two points. Take a point y of Y and let z be the orthogonal 
projection of y on L(X). Since 


ly —z||? + |lz —x||?=1 for all x of X, 


z is the center of a sphere that passes through all points of X, and since z is on 
L(X), this sphere must be the minimum-radius sphere that passes through all the 
points of X. Therefore the point z is independent of the choice of y in Y. Thus 
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the vectors zx and zy (x € X, y € Y) are always orthogonal, and we have 
dim L(X U Y) = dim L(X) + dim L(Y U {z}), 
L(X)N L(Y) ¢ L(X) N L(Y U {2}) = {2}. 


Now, if L(X)N L(Y) #9, then z € L(Y) i.e. dim L(Y U {z}) = dim L(Y), and in 
this case z must also be the center of the minimum-radius sphere that passes 
through all the points of Y. If L(X)N L(Y) =, then z is not in L(Y) and 
dim L(Y U {z})=dim L(Y) +1. O 


The following lemma is easy, and its proof is omitted (see e.g. [3]). 


Lemma 2. Let w,,..., Ww, be the vertices of a regular simplex in R" with side 
length 1, and z be the ‘center’ of the simplex. Then 
(s—1) 
Hwy — 24? =. 
(2s) 


The complement of a graph G is denoted by G. The join G + H of two graphs 
G and H is the complement of the disjoint union of G and H. For an integer 
p =2, we define 

_fl ifp=2 
f(P)= |; if p > 3. 


Lemma 3. For p, q =2, the following hold: 
(1) e(G + K,)=e(G)+f(p), 
(2) e(K(p, q))=f(p) + (9), 
(3) e(K, + K,)=s—1+f(p), 
(4) e(K, + K(p, q))=s5+f(p)+f(q). 


Proof. (1) Let m=e(G+K,). Embed G+ K, in U™ as an induced subgraph 
and let X, Y be the vertex sets of G and K, in this embedding, respectively. Then 
since Y is spherical (i.e. all points of Y lie on a sphere), we have that 
dim L(Y) =f(p). Hence by Lemma 1, we have 


m = dim L(X) + dim L(Y) 2 e(G) + f(p). 
(2) Similarly, we have e(K(p, q)) =e(K, + K,)=f(p)+f(q). To prove the 


opposite inequality, take p points x,,..., x, in R/‘?) in such a way that ||x;||? = 4 
and ||x; —x;|| #1 for every i, j. This is clearly possible. Similarly, take q points 
Vis +++ Yq in RY so that || y,||? =4 and || y; — y;|| #1. Then, the p + q points 


(x, 0) i=1,...,p and (0,y) j=l,...,4 


in R/)+f(@) = RI) x RI@ together induce a graph isomorphic to K(p, q). 
Hence, e(K(p, q))<=f(p) + f(q). 
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(3) by (1), e(K, + K,)2e(K,) +f(p)=s—1+f(p). Let m,...,w, be the 
vertices of a regular simplex of side length 1 in R*~' which is centered at the 
origin. Then ||w;,||? =(s — 1)/(2s). Take p points x,,...,x, in Rf) so that 
\|x,||? = (s + 1)/(2s) and ||x; — x,|| #1. Then the points 

(w,,0) i=1,...,5 and (0,x;) j=l,...,p 


in R°~'*f) = R*~' x RI) induce K, + Kp. 

(4) Let n=e(K,+K(p,q)). First we show that n>s+f(p)+f(q). Embed 
K, + K(p, q) = K, + K, + K, in U” as an induced subgraph, and let W, X, Y be 
the vertex sets of K,, K,, and K,, respectively. Then dimL(W)=s~—1, 
dim L(X)=f(p), and dim L(Y)=f(q). Suppose that any two of L(W), L(X), 
L(Y) has a common point. Then by Lemma 1, these three affine subspaces must 
have a common point z, which is the common center of the three spheres passing 
through the points of W, the points of X, and the points of Y. Since 
\|w — ||? =(s —1)/(2s) for all w of W by Lemma 2, we must have ||x — z||? = 
ly —2||?=(s +1)/(2s) for x of X and y of Y. Hence, we have ||x —y||? = 
\|x —z||>+||y —2||?>1, a contradiction. Thus, there must be a disjoint pair in 
L(W), L(X), L(Y). Then, again by Lemma 1, we can see that n > dim L(W) + 
dim L(X) + dim L(Y) + 12s +f(p)+f(q). 

To show the opposite inequality, let w,,...,w, be the vertices of a regular 
simplex of side length 1 in R*~' centered at the origin, and let w; =(w,, 1/V2s) in 
R*'x R= R’*. Take x, and y, as in the proof of (2). Then the s + p + q points 


(W,,0,0)i=1,...,5, (0,x;,0)j=1,...,p, (0,0,y,)k=1,...,4q 


in Rox RY xRI®D = R*KP)*S9) induce K,+K(p,q). Hence n<s+f(p)+ 
fq). O 


Proof of Theorem C. If ¢+u<1, then by (3) of Lemma 3, we have 
e(G(s, t, u))=s —1+t+2u. Now we prove that e(G(s,t,u))=s+t+2u for 
t+u=2 and furthermore, that there is a point set on a sphere of radius 1/2 in 
R‘*‘*™, which induces a subgraph isomorphic to G(s, t, u). The proof is by 
induction on ¢+ u. The case t+ u =2 follows from (4) (and its proof) of Lemma 
3. Suppose that e(G(s, ¢, u)) =s +¢+ 2u and let X be a point set on the sphere of 
radius 1/V2 in R‘*'t™ centered at the origin. We need to show that 
e(G(s,t,u)+K,)=s+t+2u+f(p), p22, and that there is a point set of a 
sphere of radius 1/V2 in R°t't7“*/), which induces a subgraph isomorphic to 
G(s, t,u)+K,. Let Y be a point set on the sphere of radius 1/2 in Rf? 


centered at the origin, which induces a subgraph isomorphic to K,. Then the 
points 
(x,0), (O,y)eR'*™*x RI?) (xeX, yeY) 
are clearly on a sphere of radius 1/2, and they induce together a subgraph 
isomorphic to G(s,t,u)+K,. Hence e(G(s, t, u) + K,) <e(G(s, t, u)) + f(p). 
The opposite inequality follows from (1) of Lemma3. 0 
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Reidemeister—-Schreier method gives generators and relations of a subgroup of finite index of 
a group defined by generators and relations. In this note we introduce a graph-theoretical 
explanation of this method and we show two examples of its application. Let G= 
(a, b | a3, b?, R), that is, G is generated by elements a of order 3 and b of order 2 with one 
relation R = 1. We show that G with R of length at most 22 whose order cannot be computed 
by coset enumeration in [6] is of infinite order. Next let G = (a, b | a',b™, R) with R = W" and 
i,m, n> 1. Baumslag et al. [1] obtained that G has a representation where the images of a, b 
and W are of order /, m and n respectively and that G is infinite if 1//+1/m+1/n<1. We 
prove the latter part of these results by using Reidemeister—-Schreier method. 


Reidemeister—Schreier method 


Let G be a group which is generated by a subset S of G and let H be a 
subgroup of G. Let I be a directed graph with vertex set G/H and arc set 


{(Hx, Hxs, s)|x €G, 5 €S}. 


We consider (Hx, Hxs, s) to be an arc from Hx to Hxs which is labeled by s and 
at the same time to be an arc from Hxs to Hx which is labeled by s~', Fis called 
the Cayley diagram or the Schreier diagram of G/H with respect to S according as 
H is the identity group or not. If H is normal in G, we also call F the Cayley 
diagram of the factor group G/H. 

Now for a spanning tree of F we assign a distinct new letter to each of the arcs 
that are not contained in the tree, and also give an orientation arbitrary fixed. In 
Reidemeister-Schreier method these new letters are the set of generators of H 
(see [S]). Suppose that relations of G with the elements of S as generators are 
given. Then the relations of H are the collection of the following sequences: we 
trace each relation of G from each vertex along the label and we have a sequence 
of new letters and their inverses according to the directions tracing along or 
against the orientations. 

Tracing each relation gives a cycle (a closed walk) in I, and the arcs assigned 
new letters form a basis of the cycle space of I’, considering each of them 
together with the spanning tree. If we assume all the generators of H commute 
mutually, then we have a presentation of the maximal abelian factor group of H. 
Graph-theoretically the cycle space generated by the relations gives the structure 
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of the factor group. In particular its order is infinite if and only if the rank of the 
cycle space is less than that of I. 


Application 


Let G=(a,b |a°, b?, R). The cyclic conjugates or the inverse of a relation 
defines the same group, and so does interchanging a and a™'. So we do not 
distinguish these relations. The length of R is defined by the total number of a*! 
and b in the sequence a*'ba*'b - - -a*'b of the relation R. Matsuyama {6} tried 
to compute the orders of all groups with R of length at most 30 by coset 
enumeration program. Following is the list of the relations R of length at most 22 
that cannot be computed in [6}. 


(ab)", n=6,7,..., 11, 

(aba~'b)", n=3,4,5, 

((ab)*a~'b)? = (ab) ‘(ab)*(b(ab)*b )ab, 
(ab)°a~'bab(a~'b)’aba~'b = (aba~'b)*(bab)~'(aba~'b)*bab, 
((ab)*(a~'b)’y’, 

(ab)’a~'b(ab)’(a~'b)’ab(a~'b)?. 


All but the last two cases are known to be infinite or have factor groups which are 
known to be infinite (see [3, 7]). 


Case ((ab)*(a~'b)’)’. First we note that it is easily seen that generally in 
Reidemeister-Schreier method we do not need to trace a relation b?=1 ona 
diagram by only identifying the two opposite arcs (Hx, Hxb, b) and (Hxb, Hx, b) 
if the two vertices Hx and Hxb are different from each other. Then for the case, 
adding a relation (ab)*=1, we have a truncated hexahedron as the diagram by 
coset enumeration. If we take a spanning tree which contains two arcs of every 
triangle, then the relation a’ gives trivial generators. So the rank of the cycle 
space is easily computed and we have this case of infinite order. 


Case (ab)’a~'b(ab)’(a~'b)’ab(a~'b). Adding a relation (ab)>=1 gives a 
truncated tetrahedron and we have that the order is infinite as above. 


Next let G=(a,b|a'‘,b”, W"). Suppose that the image G of G by the 
representation obtained in [1] is of finite order g. Let H be the kernel of this 
representation. Then in the Cayley diagram I of G which is a 2-regular directed 
graph without loops, clearly there exist g+1 arcs not belonging to a spanning 
tree. Since the image of a is of order / and since a Cayley diagram is afforded by a 
regular representation, we have g/! cycles by tracing the relation a’ in I (cf. [4]). 
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Similarly we have g/m and g/n cycles by tracing b” and W” respectively, and in 
total we have g/!+g/m+g/n cycles. Hence H has g+1 generators and 
g/l+g/m+g/n relations. So if 1/1+1/m+1/n<1, then the number of the 
relations of H is less than that of the generators of H. This implies that G is 
infinite. 


Note added in proof 


Conder [2] determined all the groups (a, b | a’, b”, R) with R of length at most 
24 in detail. 

Except three relations the author is now able to show the finiteness and 
infiniteness of the groups with R of length at most 30. 
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Caccetta and Haggkvist [1] conjectured that every digraph with n vertices and minimum 
outdegree k contains a directed cycle of length at most [n/k]. With regard to this conjecture, 
Chv&tal and Szemerédi [2] proved that if G is a digraph with n vertices and if each of these 
vertices has outdegree at least k, then G contains a cycle of length at most (n/k) + 2500. 

Our result is an improvement of this result. 


Main theorem. Let G be a digraph with n vertices such that each vertex of G has 
outdegree at least k (=2). Then G has a cycle of length at most (n/k) + 304. 


Proof. Our proof is a straightforward refinement of the highly original argument 
of [2]. So we shall simply sketch the proof. We proceed by induction on n. For 
convenience, we write t=n/k and denote the number of edges in a shortest 
directed path from x to y by dist(x, y). As in [2], we may assume that ¢ > 304, and 
that there is a vertex x such that at most 4(m + 1) vertices y have dist(x, y) < 
[4c] + 152. 

First, we define c using the equation (|4¢] + 152)(k-—c)=4(n +1), and 
consider the smallest positive integer d with the following property: 


there is a vertex s such that at most 
d(k—c) vertices y have dist(s, y)<d. (1) 
With s as in (1), we can show that, for all nonnegative integers 1, 


at most (i + 1)(k —c) vertices y have d —i <dist(s, y)<d. (2) 


For later reference, note the following inequalities: 


(302k — 1) _ (150.75)k 


tk>¢> "Cyt ) 


Case 1. For every vertex v with dist(s,v)=d—1, there is a vertex w with 
dist(v, w) < (t/74.3) — 1 and dist(s, w) < d — 3. 
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In this case, we consider the following two subsets of the vertex set of G. 
 Q:={v | dist(s, v) =d —- 1} 
P:= {u| dist(s, u) = d—2 and there are at least k/4.2 edges uv with v € Q} 


First we shall show that there is a subset R of P and a subset S$ of Q such that 
|R| <(7.4/8.4)'° k, |S| < 19 and such that for every u in P — R there is an edge uv 
with v € S. In proving this assertion, we may assume that |Q| > 19, for otherwise 
we could set R= and S=Q. 

By (2) with i= 1 and by (3), we have |Q| < 2k. Since P and Q are disjoint, we 
similarly get |P| + |Q| =|P U Q| <3k by (2) with :=2. Setting |Q|= «ek (O<e< 
2), we have |P|<(3—e)k. Note that if P=, we can simply set R=S=9. 
Therefore we may assume P #9. Thus we have |Q| > k/4.2 by the definition of 
P, and so ¢ 2 1/4.2. 

We can now construct as in [2] a sequence Ro, ..., Rig Of subsets of P and a 
sequence Sy,..., Sig Of subsets of Q such that |R,| <(3- «)(1—- 1/(4.2e))'k, 
|S;| =i and such that R, consists of those vertices u in P for which there is no edge 
uv with v € §;. 

Since maXx14.2<e«<2 (3 — €)(1 — 1/(4.2e))'? = (7.4/8.4)°, we may set R= Rj, and 
S = Sip. 

Next, we consider the following subset of V(G): 


T:={y | dist(s, y)<d—2 and y¢R}. 


We shall construct a graph H by adding certain new edges to the subgraph of G 
induced by T. These new edges uz are added for each vertex u in P — R, while z 
runs through all the vertices for which there is an edge uv with veS and 
dist(v, z)<¢/74.3. Since d< |3r] +152, at most 4(n +1) vertices y in G have 
dist(s, y)<d. If Q =9, there are no edges directed from Z to V(G) — Z, where 


Z:={y | dist(s, y) <d—2}. 


We get the desired conclusion immediately, by applying the induction hypothesis 
to the subgraph induced by Z. Thus we may assume Q #9, and therefore, we 
have |V(H)|=|T| <34n. 

By estimating the outdegree of the vertices in H (following closely the 
argument in [2]), we get 


deg 4 (u)= (1 — 1/4.2 — (7.4/8.4)')k 


for all ue T, where deg 7 (u) denotes the outdegree of u in H. We include here 
the proof for the case ue P — R. If ue P—R, then there is an edge wv with ve S. 
By the assumption of Case 1, there is a vertex w with dist(v, w) < (t/74.3) — 1 and 
dist(s, w)<d—3. Then there are at least k —(7.4/8.4)'%k vertices z in T with 
wz € E(G). (Note: We may assume that u is not among these vertices z. For if so, 
it means that G contains a directed cycle of length at most (¢/74.3) + 1.) Hence 
deg 7, (u) = (1—(7.4/8.4)'"")k = (1 — 1/4.2 — (7.4/8.4)'%)k. Since 0.5/(1 — 1/4.2— 
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(7.4/8.4)'°) + 19/74.3 <1, the rest of the proof corresponds word for word to that 
in [2]. 


Case 2. There is a vertex v with dist(s, v) =d-—1 such that no vertex w has 
dist(v, w) < (t/74.3) — 1 and dist(s, w)<d —3. 
In this case, we consider the following subset of V(G), 


R,:= {y | dist(v, y) <j and d—2<dist(s, y)<d — 1}, 
U := {w | dist(s, w) =}. 

By an argument similar to the one in Case 1, we get 
[Ul +|R1<3(kK—c) (UNR;=9). 


Hence if we set |U| = e(k —c) (Q<e <1), then we have {R,| <(3— €)(k —c) for 
all j, On the other hand, for all sufficiently large /, 


|Rj| S (2 — €)(74.3/75.3)je. (4) 
Consider the smallest j satisfying (4); we claim that 

J <(t/74.3) -1. (5) 
By the minimality of j, we have 

(2 — €)(74.3/75.3)(j — Ie < |Rj-1| < (3 — e)(k—c). 
This, together with (3), implies 


j—1<(75.3/74.3)((3 — €)/(2 — €))(t/150.75 — 1) 
< 2(75.3/74.3)(t/150.75 — 1), 
and (5) follows as t = 304. 
We consider the subgraph F induced by R;-,. By estimating the outdegree of 

the vertices y in F (following closely the argument in [2]), we have 

deg & (y) =k — e(k —c) — (2— £)(74.3/75.3)c. 
Now (3) implies 

deg } (y) >(2-)c/75.3. 
By the induction hypothesis, F contains a directed cycle of length at most 


(2 — €)(74.3/75.3)jc 


(2— 8)c/75.3 + 304 = 74.37 + 304 <¢+ 304. 
ay 3 E 


This completes the proof. O 
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The random graph K,,, is constructed on n labelled vertices by inserting each of the (3) 
possible edges independently with probability p, 0 >p <1. For a fixed graph G, the threshold 
function for existence of a subgraph of K,,, isomorphic to G has been determined by Erdés 
and Rényi [8] and Bollobds [3]. Bollob4s [3] and Karoriski [14] have established asymptotic 
Poisson and normal convergence for the number of subgraphs of K,,, isomorphic to G for 
sequences of p(n)—>0 which are slightly greater than the threshold function. We use 
techniques from asymptotic theory in statistics, designed to study sums of dependent random 
variables known as U-statistics. We note that a subgraph count has the form of an incomplete 
U-statistic, and prove asymptotic normality of subgraph counts for a wide range of values of p, 
including any constant p and sequences of p(n) tending to 0 or 1 sufficiently slowly. 


1. Introduction 


Erd6és and Rényi [8] introduced the random graph model K,,,, in which a 
graph is constructed on n labelled vertices with each of the (3) possible edges 
present with probability p, 0<p <1, independently. Without loss of generality, 
we will label the vertices {1, 2,...,}. We may view the random graph K,,, as 
determined by a set {X(i,j)}, 1<i<j<n, of independent Bernoulli random 
variables with p = {X(i, j)=1)=1—P(X(, j)=0) for all 1<j<j<n, where 
X(i, J) =1 indicates that an edge is present between i and j, and X(i, j)=0 
indicates the absence of an edge between i and j. For convenience, we let 
q=1-p. 

One often views a random graph as a structure which evolves as edges are 
added successively, or as p increases from 0 to 1. Many important properties of 
graphs appear suddenly in this evolution as the probability p crosses a threshold, 
on opposite sides of which K,,, has the property with probability 0 or 1 
asymptotically as n—>~. For most properties of interest, the threshold is a 
function of n which tends to 0 as n—>~. For a given function p(n), a property is 
said to hold for almost all graphs if it holds for a set of random graphs K,,, with 
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probability tending to one as n—> ~. A more complete discussion of the extensive 
literature on random graphs is available in the recent introduction by Palmer [17] 
and research monograph by Bollobas [4]. 

For a fixed graph G, we consider the subgraph count, S,(G), a random variable 
defined as the number of subgraphs of K,,,, which are isomorphic to G. Introduce 
the indicator function notations 
1 if A is a subgraph of K,,, 


sck.ya[t Ais 
Ac Kap) 0 otherwise 


= [| X(e) 


eeE(A) 
where E(A) denotes the edge set of graph A. 


1 if A is isomorphic to G 
ia Gy iG otherwise 


We may express the subgraph count as 


S(G)= > WA~G)I(ACK,,») 


ASK, 


=» "A~G) [] Xe) 
ASK, ee E(A) 

where K,, denotes the complete graph on the vertex set {1,2,...,}. Note that 
we identify a graph with its set of edges, so the approach applies directly only to 
graphs G with no isolated vertices. However, the results can be easily extended to 
graphs which have isolated vertices. 

If G is a graph with k edges, we need only consider subgraphs A with k edges, 
so 


S(G)= S WA~G) TT X€) 


ASK, ee E(A) 
|E(A)I=k 
which has the form 
> w(e, e2; sey ex)h(X(e,), X(e2), cone X(e,)) (*) 
C1, €2,-06) é, distinct 
e,€E(K,) 
in which w(e,,..., €,) is a nonrandom indicator function. 


The primary goal of this paper is to establish that the subgraph count random 
variable S,(G) has an asymptotic normal distribution for all graphs G with no 
isolated vertices, and for a broad range of probability functions p(n). A 
secondary purpose is to introduce statistical methods for the treatment of random 
variables of the form (*), which are called weighted U-statistics, which may be 
relevant to other random graph problems. Previous results on the threshold for 
existence of a subgraph of K,,, isomorphic to G, and on limiting Poisson and 
normal distributions for subgraph counts for sequences p(n) converging to 0 but 
slightly above the existence threshold, are discussed in Section 2. The principal 
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result, stated in Section 3, establishes asymptotic normality of subgraph counts 
when p is constant, and when the sequence p(n) converges to 0 or 1 sufficiently 
slowly. The result fills a substantial gap in the literature on asymptotic subgraph 
count distributions. Examples illustrating the application of our result to stars, 
trees, cycles, and complete graphs are also presented in Section 3. Section 4 
discusses statistical tools for treating sums of dependent random variables, from 
the theory of U-statistics, and adapts them for treatment of subgraph counts. For 
computation of variances of the relevant statistics, counts of intersections of 
isomorphic subgraphs are considered in Section 5. 


2. Previous subgraph count results 


The determination of the threshold for the existence of a given graph G as a 
subgraph of K,,, was the focus of Theorem 1 of Erdés and Rényi [8]. For this 
problem, Erdés and Rényi introduced the concept of a balanced graph. Define 
the degree of a graph G by 


d(G) = |E(G)I/|V(G). 


A graph G is balanced if d(G)=d(H) for every subgraph H of G. Let ADB 
denote that A contains a subgraph isomorphic to B. Erdés and Rényi proved that 
if G is balanced 


0 ifp(n)n'4 50 asn>@ 


bas PIKnp >G]= (i if p(n)n!4© > wasn 


which identifies the threshold function for existence of a balanced graph G as 
no, 

Bollobds [3] generalized this result to arbitrary graphs G. Define m(G) as the 
maximal degree of any subgraph of G. Note that m(G) = d(G) if and only if G is 
balanced. Then for any graph G, 


0 if p(n)n""©—>0asn>@ 


sues P[Knp 2 G]= 1; if p(n)n""O  w as n> 


ie. the existence threshold is n~'”"°. The Bollobds proof uses a rather intricate 
method called grading, but a short elementary proof has been supplied by 
Rucinski and Vince [19] using the second moment method. 

A graph G is strictly balanced if d(G) > d(H) for every proper subgraph H of 
G. This concept plays a crucial role in obtaining asymptotic distributions for 
subgraph counts near the threshold function for existence. Independently, 
Bollobds [3] and Karoriski and Rucinski [15] proved the following: Let G be a 
strictly balanced graph with k edges, / vertices, m=m(G)=k/l, and an 
automorphism group of order a. Let p(n)n”"— c as n—», for some 0<c<~, 
Then S,(G) has an asymptotic Poisson distribution with mean c*/a. 

Research of Rucinski and Vince [19] shows that the factorial moment 
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convergence method for establishing Poisson convergence does not apply if G is 
not strictly balanced. While their result does not prove that Poisson convergence 
is impossible, it strongly suggests that Poisson convergence of a subgraph count 
holds if and only if the graph is strictly balanced. 

For a small range of sequences p(n) above the existence threshold, Karofiski 
and Rucinski [15] established asymptotic normality for subgraph counts. If G is a 
strictly balanced graph, and p(n)n"(©)- o, but for any 6 >0, p(n)n™"(@-e, 
0, then S,(G) is asymptotically normally distributed. 

Novicki [16] treats multiple subgraph count statistics, obtaining multivariate 
normal limiting distributions, and also treats induced subgraph count statistics, 
for constant values of p. 

Janson [13] applies the method of semi-invariants to derive limiting normal 
distributions for induced subgraph counts to deal with graphs for which the usual 
normalization is not valid for certain constant values of p. 


3. Statement of results 


Theorem 3.1. Let G be a graph with no isolated vertices. Suppose that G has k 
edges and | vertices, and that 


np‘ \—> +o 
n'(1—p)—> +@. 
Then 
S,(G) — E[S,(G)] 


(2) Huan Youn 


has an asymptotic Normal (0, 1) distribution. 


Remark 1. Let G’ be a graph with /+/m vertices, m of which are isolated 
vertices, and let G be the graph obtained by deleting the isolated vertices from 
G’. For an isomorphic copy of G on a set of / vertices, there are ("7,') sets of 
vertices that may be added to obtain a copy of G’. Thus, S,(G’) =("7')S,(G), 
and the asymptotic distribution for S,(G’) is easily obtained from Theorem 3.1. 


Remark 2. The subgraph count S,(G) is a sum of dependent random variables, 
for which the exact calculation of the variance may be long and tedious. Our 
method approximates S,(G) by a sum of independent random variables, called 
the projection of S,(G), for which the variance calculation is elementary, 
providing the denominator of the normalized random variable in the conclusion 
of Theorem 3.1. 
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Remark 3. The upper end of the range of normal convergence does not depend 
on the graph G, as the lower end does, and is the best possible bound: If 
n°(1—p)—c for some 0<c<», then (3)(1—p)— 4c, which implies that the 
number of edges of K,, which are absent in K,,, has a limiting Poisson distribution 
with mean 4c . As a consequence, the number of subgraphs of K,, isomorphic to 
G which are not subgraphs of K,,,, properly standardized, has an asymptotic 
Poisson distribution. If n?(1—p)— 0, then almost all graphs are complete, so the 
subgraph count is a deterministic function of n with probability tending to one. 


Remark 4. For a star on d vertices, Theorem 3.1 provides normal convergence 
when p=w,n~4-) where w,— +, Since the star is strictly balanced with 
average degree (d—1)/d, the threshold function for existence of d-stars is 
n~@(4-)) and normal convergence results of Karonski and Rucinski [15] apply 
when p, =w,n 44") where w,— © but w, =o(n°) for any 6>0. The range 
between n~/4- and n-“4—- is not covered by either result. 

For a cycle of length d, Theorem 3.1 applies when p, =w,n~“¢-) where 
w, — , and previous results apply for p, =w,n~! where w,— © but w, = a(n°) 
for any 6 > 0. 

For the complete graph on d vertices, Theorem 3.1 gives 

Dn = Wyn D-H = wn -ME-2004+4)1, 
where w,—®, as the lower bound on the range of normal convergence. 
However, the previous results give normal convergence for sequences near the 
threshold n-~¢~), again leaving a range of values where the asymptotic behavior 
is unknown, Rucinski [18] recently proved that normal convergence holds in this 
range. 


4. U-statistic methodology 


Consider a sequence X,, X2, X3,... of independent identically distributed 
random variables. If A is a symmetric function of k variables, the U-statistic with 
kernel A based on n observations is defined by 


=k 
U, = (") SRE 0, re. Cremer #9) 
{its---ség }EC(n,K) 
where C(n, k) denotes the set of subsets of size k from {1, 2,..., n}. 
U-statistics were introduced by Hoeffding [11] as a generalization of the sample 
mean, and provide a class of unbiased estimators of distributional parameters in 
statistical estimation theory. Many common statistics, such as the sample mean, 
sample variance, and Wilcoxon test statistics, are U-statistics. The strong law of 
large numbers for U-statistics was established by Hoeffding [12] and Berk [1] using 
martingale methods. The central limit theorem was established in the original 
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paper of Hoeffding, and the rate of convergence to normality has been 
investigated by Grams and Serfling [9], Bickel [2], Chan and Wierman [7], 
Callaert and Jansen [6], and Helmers and Van Zwet [10]. U-statistics are useful 
as approximations to other classes of statistics, such as linear combinations of 
order statistics. 

If each term A(X;,,, X;,,..-, Xi,) is weighted by a factor w(i,, i2,...,i%), we 
have the more general form of a weighted U-statistic 


W,, = S w(i, iy, ware gy i, )A(X,,, Xj,5 Sales X;,): 
Cc 


If the weights w(i,, i2,...,i,) take only 0 or 1 as values, the statistic W, 
represents an ‘incomplete’ or ‘reduced’ U-statistic sum. Incomplete U-statistics 
are designed to be computationally simpler than the full sum, based on the 
reasoning that it should be possible to use fewer terms without much loss of 
information. Incomplete U-statistics have been investigated by Brown and Kildea 
[5], who proved asymptotic normality under certain balance conditions on the 
weights. Shapiro and Hubert [20] investigated asymptotic normality for weighted 
U-statistics. 

The asymptotic behavior of a weighted U-statistic may often be determined 
using Hajek’s projection method, approximating the weighted U-statistic by a 
sum of independent random variables. Define W7, the projection of W,, by 


Wr = > E[W, | Xi] - (2 - DE[W,]. 
i=l 
The original weighted U-statistic may be investigated by writing 


W, — E(W,]=(W, — E[W,]] + [W, — Wr]; 


and considering each term on the right side separately. 
The projection W? has the same mean value as W,, so 


ws - E[W,]= y [E[W, |X] - EWI] 


is a sum of independent, identically distributed, mean zero random variables. 
Under appropriate conditions, its asymptotic distribution may be derived by a 
standard central limit theorem. 

We will rely on a central limit theorem formulated for a double array of 
random variables {X,,;}, in which for each n> 1, there are k, random variables 
{X,;,1<j<k,}, where it is assumed that k,—* as n>. Denote the 
distribution function of X,, by F,;, and let 


y 
Ly = E[X,,] 


L,= |S X,,| = Ss Lnj 


i=1 j=l 
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and 


= Var| X| 


Theorem 4.1. Let {X,;:1<j<k,} be a double array in which the random 
variables in each row are independent. If the Lindeberg condition 
1 & 


Sf Cun) de) 0.05 n> 
Tn j=1 J\t—ppl>eo, 


is satisfied for each « >0, then 


pee (Xj _ Ln) 
Tn 


has an asymptotic standard normal distribution. 


Note that the independence is assumed only within rows, but random variables in 
different rows may be arbtrarily strongly dependent. 


To complete the analysis, one wishes to show that the error in the approxima- 
tion, W,, — W;, is negligible, so the asymptotic behavior of W, is the same as that 
of W%. This is often accomplished by the use of moment inequalities, such as 
Chebyshev’s Inequality. If Var(W, — W?) =0(Var(W;)), then by Chebyshev’s 
inequality 

(im - Wal :)< 2 Var(W,, — =Wr) 
Var(W*) e? Var(W;) 
for every € >0, so (W, — W*)/VVar(W3) converges to 0 in probability, and the 


error in the approximation by the projection is negligible. Then the asymptotic 
normal distribution is obtained by ii 


W, — El] W2- ELM], We WE 9 veg 1 
Vvar(W*) ov) +e a moot 


by applying then central limit theorem to the projection term. 
To aid in computing the variance of W,, - W7, we now show that it may also be 
written in the form of a weighted U-statistic. Without loss of generality, we 


assume that E[h(X,,..., X,)] =0. The projection W; is given by 
wi, ae ix)[g(Xi,) pets + 9(X;,)] 
{ty,..68 izyeC(n,k) 
where 
B(x) =E[A(X, ... XK vy Ax) [X= x}; 


Thus, we have the weighted U-statistic representation. 


W,, i Wi, = > wi, i, Ssh <9 1, )WX,,, Xi sees Xi,)) 


C(n.k) 
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where k 
w(X,, Xo... Xe) =A(X, Xz... , Xe) — D 9(X). 
i=l 


The variance of W, — W* is a weighted sum of expected values of products of the 
form 


W(X, se a X,, U(X: scenes 9 X;,): 
Note that if {i,,...,i,} and {j,,...,j,} are disjoint, the expected value of the 
product is zero, by independence of the sets of random variables. In addition, if 
there is only one index in common, the expected value is still zero. To see this, 
compute 
E[w(Xi,, ie J Xi, V(X, Xj, aaa X;,)] 
= E(E[w(X,,,... 5 Xi) W(X, Xj. Xi) | Xj, Xj +s Xj, |] 
=E[wW(X,,, Xi ---, Xj JELW(X,,, Xin - Xi) | X,,]] 
and use 
E[p(X,,, Xj tay Xi,) | X;,) 


= E[h(X;,; sees X;,) — g(X;,)- cea — 8(X;,) | X;,] 
=E[h(X,,,.--, Xi,) | Xa] - 8(%,) 


Thus, only terms with two or more indices in common make a contribution to the 
variance of W, — W;>. 

By the Cauchy—Schwartz inequality, the contribution of any non-zero 
term is no greater than E[wy(X,,...,X;,)’], which is bounded by 
(k+1PE[A(X,,..., X,)]- 


5. Application to subgraph counts 


Let G be a graph with k edges and / vertices, none of which are isolated, and 
consider the subgraph count 


S,=S(G)=  WA~G) I] Xe). 
ASKn e€ E(A) 
{E(A)|=k 
By subtracting the mean, we obtain 
$,(G)- E[S(G= Y WA~G) TT xe)-p'} 
AcK, ee E(A) 
1E(A)l=k 


which is a weighted U-statistic with kernel 


k 
A(X,,---, X) =[] X:—p*, 
i=1 


Subgraph counts in random graphs 307 
which has mean zero. The corresponding conditional expectation is 


k 
a(X)=E| [TX |X] -2" 
i=i 
=p*X, —p*, 
so the projection S? has the form 


Sr = 3 a,(p*"'X, — p*) 
eeK, 
where a,, denotes the number of subgraphs of K,, which contain a fixed edge and 
are isomorphic to G. 

To compute a,, note that there are (7= 2) different sets of / vertices containing 
the endpoints of the fixed edge. Thus, a, = (7— ?)b, where b, denotes the number 
of subgraphs isomorphic to G on a set of / vertices [which without loss of 
generality we take to be {1,2,...,/}] with an edge between a fixed pair of 
vertices. 

Let C(n, G) denote the set of labelled subgraphs of K, which are isomorphic to 
G. To find b,, we count the number of edges in C(/, G) by two different 
procedures. First, there are (3) pairs of vertices, each having an edge in b, 
subgraphs, for a total of (4)b, edges. Second, since there are /! orderings of the 
vertices, there are /!/a different subgraphs isomorphic to G, where a denotes the 
order of the automorphism group of G. Since each of these subgraphs has k 
edges, there is a total of k(/!)/a. Therefore, 


"ae 
PA 


x n-2 2k fat k-ly _/_7k 
si= 5 (75) ¢-21 ex. -P. 


eeK, 


so 


Since the summands in S¥ are independent and identically distributed, 
n n—2\ 2k 
ar(S?) 5) VaN\ > Pau 2)! p 


n\(n a 5 ap 

= — (@-2)!Pp*-?pa - 5. 

(5) 5) S102)" -p) (5.1) 
= O(n" ?p*"\(1—p)] asn>~. 

Each of the summands in S* is bounded by 


B, = (es) ate — 2)! max{p*~' — p*, p*}. 


If B, =oVVar(S7), then the Lindeberg condition in Theorem 4.1 is satisfied. If 
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p is a constant independent of n, then B, = O(n'~*) and Var(S7) = O(n”~’), so 
the Lindeberg condition is satisfied. If p(n)—0, B, =«VVar(S;) if and only if 


n'2pk-'V/1 = p - 1 ah 
nip ky i —p — np} 


i.e. 
pn’, 
If p(n) 1, B, =eV Var(S;) if and only if 
n'~2y* 1 


r = 0 
np iV1—pnvVi =. 


i.e. (1—p)n’?—>. When both of these convergence conditions hold, by the 
Lindeberg—Feller central limit theorem [Theorem 4.1], we have an asymptotic 
standard normal distribution for 


Si = E{S,] 


(22) Bua Voy 


To complete the proof of Theorem 3.1, we next show that Var(S, — S7)= 
o(Var(S*)). This, by Chebyshev’s inequality, implies that the error in the 
approximation of S, by S; is negligible. By the discussion in Section 4, S,—S? 
may be represented as 


> MA ~ G)Q(Xi,,-- 1 Xi) 


(Be, igha{h...., n} 


wy kd= (TX e'}- {3 ox 2}. 


The terms in S, — S* are uncorrelated unless there are two or more indices in 
common, which is equivalent to the corresponding subgraphs having two or more 
common edges. 

For d=2,...,k, define 


fa = |{((A, B):|E(A)N E(B)| =, A, Be C(n, G)|. 


i.e. f, is the number of pairs of subgraphs isomorphic to G with exactly d common 
edges. 

To compute f,, decompose the set according to the number of common 
vertices, defining 


fal) = |{((A, B):|E(A) N E(B)| = 4, |V(A) N V(B)| =i; A, BE C(n, G)}| 


for i= 3, 4,...,/ (since if A and B share two or more common edges, they have 
at least 3 common vertices). Then, 


fa= 2, Jali): 


Subgraph counts in random graphs 309 


For each i= 3, 4,...,2, we choose the i common vertices and / —i additional 
vertices in each of A and B, so 


fa(i) = & aioe Jed, 


where e,(i) denotes the number of pairs (A, B) which can be obtained on two 
fixed sets of vertices V,=V(A) and V,=V(B) such that |V,NV,|=: and 
|E(A)N E(B)|=d. Since e,(i), i=3,4,...,k, is a sequence of constants 
independent of n, we find that 


nee Nea 


= O(n) 
Therefore 
k 
eS fa= On is | 
d=2 
Since 


Var(S, — S*) < (> n)elwex vee X)] 


< O(n" (k + 1PE[A(X,,..., X,)]) 
= O(n —3 pra p*)), 
we have 
Var(S, — S7) 


Var(S*) ( eee 


nt? kA —p) 


1-p* 1 
1—p np 


Ml 
Ss 


if np‘~'> ~, 

Thus, under the hypotheses of Theorem 3.1, the projection is asymptotically 
normal and the error in the approximation is negligible, so the conclusion follows 
by the general theory discussed in Section 4. 
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1. Introduction and terminology 


All graphs in this paper will be finite and connected and will have no loops or 
parallel lines. 

Let n and p be positive integers with n < 4(p — 2) and let G be a graph with p 
points having a perfect matching. Graph G is said to be n-extendable if every 
matching of size n in G extends to a perfect matching. 

The concept of n-extendability for bipartite graphs was studied by Hetyei ((9]). 
But the study of the more general family of n-extendable graphs which are not 
necessarily bipartite seems to have even earlier roots. In the late 1950s, Kotzig 
[10-12] began to develop a decomposition theory for graphs with perfect 
matchings, but unfortunately these papers did not receive the attention that they 
deserve, due to the fact that they were written in Slovak. In the early 1960s, the 
study of decompositions of graphs in terms of their maximum matchings was 
begun by Gallai (7, 8] and independently by Edmonds [4]. One of the degenerate 
cases of their theory for maximum matchings, however, arises when the graphs in 
question have perfect matchings. 

Motivated by these results of Kotzig, Hetyei, Gallai and Edmonds, Lovasz [13] 
extended and refined the canonical decompositions already extant. 

In this same paper, Lovasz also introduced the concept of a bicritical graph. A 
graph G is said to be bicritical if G — u — v has a perfect matching for every pair 
of distinct points u and v in V(G). In the last ten years or so, the earlier work on 
decompositions of graphs in terms of their matchings has evolved further (see 
Lovdsz and Plummer ({15]) and today much attention continues to be focused 
upon the structure of bicritical graphs which are, in addition, 3-connected. Such 
graphs have been christened bricks. (See, for example, the paper by Edmonds et 
al. [5S] and that of Lovasz [14].) 

But what is the connection between n-extendability and bicriticality? One of 
the results presented in Plummer [16] states that every 2-extendable graph is 
either bipartite or is a brick. (The reader should convince himself immediately 
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that these two classes of graphs are disjoint.) Motivated by this result, the author 
has continued to study properties of n-extendable graphs (see [17—20]}). 

Let S be a point cutset in graph G and let c(G —S) denote the number of 
components in G — S. Then, if G is not complete, the toughness of G is defined to 
be min |S|/c(G — S) where the minimum is taken over all point cutsets S of G, 
whereas we define the toughness of K, to be + for all n. We denote the 
toughness of G by tough(G). We will also say that graph G is k-tough if 
tough(G) =k. This parameter was introduced by Chvadtal [2, 3] who was initially 
motivated by studies about Hamiltonian cycles in graphs. He noted, however, in 
{2} that every 1-tough graph with an even number of points has a perfect 
matching. 

A generalization of both the concepts of Hamiltonian cycle and perfect 
matching is the idea of a k-factor of a graph. A k-factor of a graph G is a 
spanning subgraph of G which is regular of degree k. Thus a perfect matching is 
just a 1-factor and a Hamiltonian cycle is just a connected 2-factor. Chvatal 
conjectured in [2] that if G is any graph on p points and if k is a positive integer 
such that G is k-tough and kp is even, then G has a k-factor. This conjecture has 
only recently been settled in the affirmative by Enomoto et al. [6]. 

In the present paper, we wish to treat some relationships between toughness of 
a graph and the n-extendability of the graph. In the next section we will prove 
two results. The first says essentially that if a graph has sufficiently high toughness 
(and has an even number of points) then it must be n-extendable. The second 
result applies to graphs with toughness less than one and presents an upper bound 
on the value of n for which such a graph can be n-extendable. 

In the final section, we compare and contrast these results with the n-factor 
results of Enomoto et al. 

Any graph terminology used, but not defined, in this paper may be found 
either in Bondy and Murty [1] or Lovdsz and Plummer [15]. 


2. Two results on toughness and n-extendability 
In addition to the theorem of the author [16] mentioned in the Introduction, 
there are two other results proved in that paper which we shall use here and 


hence we begin by stating them without proof. 


1980A. Theorem. If n=2 and G is n-extendable, then G is also (n—1)- 
extendable. 


1980B. Theorem. If G is n-extendable, then G is (n + 1)-connected. 


Our first result of the present paper follows in a straightforward way via Tutte’s 
classical theorem characterizing graphs with perfect matchings. 
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2.1, Theorem. Suppose that G is a graph with p =|V(G)| points with p even. Let 
n be a positive integer with p2=2n+2. Then if tough(G)>n, graph G is 
n-extendable. Moreover, this lower bound on tough(G) is sharp for all n. 


Proof. First suppose that n =1. Note that since tough(G)>1, graph G has a 
perfect matching by Tutte’s Theorem on perfect matchings. 

Now suppose that for some line e=xy e€E(G), line e lies in no perfect 
matching for G. Thus if G’=G-—x-—y, by the above-mentioned theorem of 
Tutte there is a set S’ c V(G’) with |S’|<c,(G’ — S’). (Note that here c,(G' — 
S’) denotes the number of components of G’ — $’ which have an odd number of 
points.) But then by parity, |S’|<c,(G’ — $’) —2. 

Now let 5,=S’U {x, y}. Since G has a perfect matching, it follows that 
Co(G — So) < [Sol = |S’] +2 <c,(G’ — S’). But G-—S,=G’—S" and so equality 
holds throughout and in particular, c,(G — Sp) = |Sp|. (See Fig. 1.) 

But now 


mn SL LS 

tough(G) = mn AG =5) GS) (eG =&) 
contradicting the hypothesis of this theorem. So the desired conclusion holds 
when n = 1. 

Now suppose n>2, and assume that G is not n-extendable. Let M= 
{e,,...,,} be a matching of size n which does not extend to a perfect matching. 
Denote e;=x,;y, for i=1,...,n. Let G=G-—x,--++-xX,-YW- Np 
Hence G, has no perfect matching and thus by Tutte’s Theorem, there is a set 
S,< V(G,) such that |S,|<c,(G,—S,). Hence by parity, |S,|<c,(G, — S,) —2. 
(Note that S, might be empty.) 

Now G, = G —x,, — y, has a perfect matching since we have already proved that 
G is 1-extendable. Let S, = $,U {x,,...,X,-1, ¥iy +--+» Yn—-1} and note that once 


OOOOO0O 


the odd components 
of c'-s' 


Fig. 1. 
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again by Tutte’s Theorem, c,(G2— S,) <|S,| = |S,| + 2n —2<c,(G, — S,) + 2n - 
4. But G, — S$, =G, — S, and so it follows that c,(G,— Sz) = |S,| +2. 

Now let §,=5S, U {x1, oes XpnrViy--- iVigk: Then, since G —-$,= G,- $3, we 
have 


; \S| $3] [Ss] 
h(G)< Go eG el 
tough(G) < ese -S) c(G-S;) c(G,— S$) 
Ssh S| + 2n 


< 


~ €o(G2— S) Co(G2 - $2) 


|S,|+2n n|S,|+2n 
<——— <———— =n, 
|S,| +2 |S,| +2 


again a contradiction of the hypothesis. 

It remains only to exhibit extremal graphs for each value of n >1. For each 
positive integer n = 1, define graph H,, as follows. Join each of the 2n points of a 
copy of the complete graph on 2n points, K$!), to each point of two disjoint 
copies of the complete graph K>,,,,. (See Fig. 2.) Then |V(H,,)| = 6n + 2 and it is 
easy to see that tough(H,,) =n. However, if N is any set of n independent lines in 
KS), then N does not extend to a perfect matching. O 


Now let us begin to think of some type of converse to the above result. Let us 
remark at the outset that if a graph G is n-extendable, there is no lower bound on 
the toughness of G. To see this, we construct the following family of graphs. Let n 
and k be any two positive integers. Let S be a set of 2n independent points and let 
graph J(n, k) be constructed by joining each point of set S to both endpoints of 
each of 2n + k independent lines. (See Fig. 3.) 

It is easy to verify that J(n, k) is n-extendable for every value of k. Clearly, 
tough(J(n, k)) < 2n/(2n +k), and hence tough(J(n, k))— 0 as k>. Of course 
the number of points in graph J(n, k) is quite large and it makes sense to amend 
our search for some type of converse to Theorem 2.1 as follows. Again letting 
p=|V(G)|, we may ask if there is a function f(p) such that if graph G is 
f(p)-extendable, then G is, say, 1-tough. The next result shows that the answer 
to this question is “‘yes”’. 


EOS 


Fig. 2. The extremal family {H,}7.,- 
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Fig. 3. The extremal family (J(n, k)}7 1 4<1- 


2.2. Theorem. Let G be a graph with p points and let n be a positive integer. 
Suppose that G is n-extendable, but that tough(G)<1. Then n<|p —2/6| and 
this bound is sharp for each n. 


Proof. Since tough(G) <1, there is a cutset S in G such that G—S has more 
than |S|=s components. Note that by Theorem 1980B, s=n+122. Let the 
components of G— S be C,,..., C,4,, where r>1. 


Note that G — S$ must have at least one even component, for if not, by Tutte’s 
Theorem, G could not have a perfect matching, contradicting the hypothesis of 
the present theorem. So suppose that component C, is even and hence 
|V(C,)| 22. Since n =1, G is 2-connected by Theorem 1980B, and hence there 
exists a line e, joining a point of C, to a point of S. By hypothesis, G is 
n-extendable and n = 1, so by Theorem 1980A, G is 1-extendable. So extend line 
e, to a perfect matching F, of G and note that by parity, F, matches at least two 
points of component C;, into set S. It then follows that, in fact, G — S must have 
at least 3 even components. 


Claim 1. G — S has at least n even components. 


If 1<n <3, we are done. So we may suppose that n > 4. 

Suppose, to the contrary, that G—S has ¢ even components, where 3<¢< 
n—1. Relabeling these components if necessary, suppose that C,, ..., C, are the 
even components of G—S. (Recall that altogether, G—S has st+r2=s+12 
n +2 components.) 

We now construct a matching which contains two lines joining each of the 
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components C,,..., C, to different points of S. Let e, be any line joining C, toa 
point u, in S, relabeling the points of S if necessary. Now if all lines between S 
and C, are incident with point u,, it then follows that u, is a cutpoint of G, a 
contradiction of the fact that G is 2-connected. Hence we can match points of C, 
and C, via lines e, and e2 to distinct points u, and u, of S say, where once again 
we relabel the points of S if necessary. 

Recall that n => 4. Suppose further that C,,...,C,, g<n, have been matched 
into S to points u,,..., u, respectively. If we cannot match a point of V(C,.,) 
into S at a point different from u,,...,u,, then {u,,...,u,} is a cutset of G; 
that is, it separates C,,, from all the other components of G—S. Hence 
k(G)<q<n, contradicting the fact that (by Theorem 1980B) graph G is 
(n + 1)-connected. Thus we have the matching of size n that we seek. Call it M,. 

Extend matching M, to a perfect matching F, of G. By parity, for each even 
component C,,...,C,, matching — must match at least one point to S other 
than that matched by M,. Without loss of generality, let us suppose that a point 
of C, is matched to point u,,,,..., and that a point of C, is matched to point 
U,+, (See Fig. 4.) 

But now each of the odd components C,,,1,..., C,,, must contain at least one 


Fig. 4. 
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point which is matched by perfect matching F, into the set {u,,4,41,..., Us} of S. 
Thus it follows that s+r—n<s—(n+18) and so r<-—t<0O, a contradiction and 
Claim 1 is proved. 
Finally, we prove 


Claim 2. Graph G — S has at least 2n + r even components. 


By Claim 1, graph G—S has at least n even components. Relabeling if 


necessary, suppose that they are C,,...,C,,..., C,4,. Just as in the proof of 
Claim 1, since G is n-connected, we can find a matching M, which joins exactly 
One point of component C; to a point u; in S for i=1,...,n, where yet again, we 


renumber points in S if necessary. 

Since G is n-extendable, let us extend matching M, to a perfect matching F, of 
G. By parity, each of C,,..., C, has at least 2 points matched into S by F,. So 
relabeling again if need be, assume that F, also matches a point u,,, to a point in 
component C; for i=1,...,n. (In particular at this point, we now know that 
|S| =s =2n.) 

Thus F; must match the remaining s — 2n points of S (if any) to some s — 2n 
points in }2;V(C,). So among the components C,4,,...,C,4,, at least 
(s+r—n)—(s—2n)=n+r must be even. These components, together with 
C,,..., C, give the 2n +r even components of G — S as claimed. 

Now we have 


IV(G)| =p =|S| + |V(C,)| +--+ +1V(C,.,)| 
2s+2(Qn+r)+(s+r—(2n+r)) 
=s+4n+2r+s—2n=25+2n+2r26n + 2r=6n +2. 


So n <(p —2)/6 and since n is an integer, n < |p — 2/6]. 

To show that the bound is sharp for all n =1 consider the infinite family of 
graphs L, where L, =J(n,1) and J(n,1) is shown in Fig. 3. Note that 
p=(|V(L,)| =2n + 2(2n + 1)=6n +2 and hence n =(p —2)/6 and it is easy to 
check that graph L,, is n-extendable, but not (m + 1)-extendable. O 


Of course, Theorem 2.2 can be restated as follows: if graph G is (|p — 2/6] + 
1)-extendable, then G is 1-tough. 
3. Comparisons with an n-factor theorem 

Enomoto et al. [6] have proved the following result. 


1985. Theorem. Let G be a graph with at least n+1 points and suppose 
tough(G) =n. Then, if n|V(G)| is even, G has an n-factor. 
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This theorem answers in the affirmative a conjecture of Chvatal. In order to 
properly compare the conclusion of this result with that of our Theorem 2.1, let 
us try to state each result in as parallel a fashion as possible. Of course, if we 
were to define a graph to be ‘‘0-extendable”’ if it had a perfect matching, the two 
conclusions would say exactly the same thing when n = 1. 

Now suppose n =2 and consider the following two statements; the first being 
the result of Theorem 1985 and the second, our Theorem 2.1. 

(A) tough(G) =n >G has an n-factor. 

(B) tough(G) = n> G is (n — 1)-extendable. 

We claim that the two conclusions are independent, in that neither implies the 
other. 

First consider the family of graphs J(n, 1) already discussed above. Suppose 
n=2. We already know that graph J(n, 1) is n-extendable. Hence by Theorem 
1980A it is also (n — 1)-extendable. We claim it has no n-factor. 

Suppose, to the contrary, that J(n, 1) does have an n-factor, F. Then factor F 
must send 2n” lines from set S to G—S. But each point of G — S must send at 
least n — 1 lines to set S and hence we have at least (n — 1)(4n + 2) =4n*?—-2n-2 
lines of factor F from G — S to S. But then 2n? > 4n? — 2n — 2 and it follows that 
n=1, a contradiction. 

Finally, consider the infinite family of graphs {M,,};-4 constructed as follows. 
Graph M,, is formed by taking two copies of the complete graph K,,,, and joining 
corresponding points of the two copies with a perfect matching. (This is, of 
course, just the prism over K,,,.) Graph M, clearly has an n-factor consisting of 
precisely two components; namely, the two copies of K,,,. On the other hand, 
we claim that graph M, is not (nm — 1)-extendable. Let the points of the “top” 
K,,4, be u,,..., Un, and the corresponding points of the “bottom” K,.,, be 
Vi,.--5 Una. (That is, ujv;, i=1,...,2+1, is the perfect matching joining the 
top and bottom.) 


4 5 
Fig. 5. Extremal graphs M, and M.. 
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In order to prove our assertion, let us consider the cases for n odd and even 
separately. 

First suppose n is odd and n=5. Select the matching M, consisting of u,v,, 
Uju,, U3u,-,,... together with line v,v,.,. Clearly M, is a matching of size 
n—1, but it cannot be extended to a perfect matching for point u,,, could never 
be matched in such a extension. 

Now suppose n = 3. Let M; consist of two disjoint copies of K,4 joined by three 
independent lines. Clearly, graph M; is not 2-extendable, but it has a 3-factor. 

Now suppose n is even. First let us suppose also that n >= 4. In this case, select 
the matching M, to consist of u,u,, U2u,_-,,... together with line v,v,,,. Then 
matching M, has size n — 1, but it cannot be extended to a perfect matching since 
point u,,,, could never be matched. 

Suppose n = 2. Let M, be the graph K,—e for any line e in K,. Clearly M, is 
not 1-extendable, but it has a 2-factor. 

So in summary, the graphs {M,,}, for n =2,3,... have the properties that M,, 
has an n-factor, but is not (n — 1)-extendable. 

We show the graphs M, and M, in Fig. 5. 

The author is indebted to the referee for a careful reading of this paper and for 
his helpful comments. 
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If G denotes a graph of order n, then the adjacency matrix of an orientation G of G can be 
thought of as the adjacency matrix of a bipartite graph B(G) of order 2n, where the rows and 
columns correspond to the bipartition of B(G). Fora a graph H, let k(H) denote the number of 
connected components of H. Set m(G)= min{k(B(G)): G an orientation of G} and M(G)= 
max{k(B(G):G an orientation of G}. R.A. Brualdi et al. [1] introduced these ideas and, 
among other results, proved that for a connected graph G of order n, m(G) = M(G)=n + 1 if 
and only if G is a tree. We prove an intermediate value theorem for k(B(G)) and investigate 
the minimum and maximum number of edges possible in a graph G of order n for fixed 
k(B(G)). In particular, we treat the case when G is complete, so that G is a tournament. 


1. 


If G is a graph, then G will denote a digraph obtained by orienting the edges of 
G and will be called an orientation of G. For example, if G is K,, the complete 
graph with n vertices, then G is an n-tournament. Og will denote the set of all 
orientations of G, V(G) and E(G) will denote the vertex set and edge set, 
respectively, of G, and k(G) will denote the number of connected components of 
G. If Ge Og, V(G) = V(G) = {u;,..., Un}, and A is the adjacency matrix of G 
(i.e. ay=1, if u;—>u; in G, and a, =0, otherwise), then B(G) denotes the 
bipartite graph with bipartition V ={v,,...,U,} and W={w,,...,w,} where 
v; is adjacent to w, if a;=1, 1<i, j<n (ie. uj uy, in G). This paper concerns 
values of k(B(G)) for various orientations G of a graph G and is motivated by 
questions raised by Brualdi et al. {1} in their treatment of these ideas. 

Let m(G)= min{k(B(G)): GeO} and M(G)= max{k(B(G)):G € Oc}. 
Brualdi et al. [1] proved that if G is a connected graph with n vertices, then 
M(G) <n + 1 with equality iff G is bipartite, and G is a tree iff mxoy: = M(G)= 
n+1, And, they raised the following problems: 

(1) For a given graph G and integer t with m(G)<t=<M(G), does there exist 

an orientation G € Og so that k(B(G)) = 1? 
(2) What are the minimum and maximum number of edges in a graph with n 
vertices and given values of m and M? 

We given an affirmative answer to the first problem and lay some ground-work 

for the second problem, namely for fixed n and k, 1=<k <2n, we establish the 
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maximum number of edges possible in a graph H with n vertices so that 
k(B(H)) =k for some H € Oy, and the minimum number of edges possible in a 
graph G with n vertices so that k(B(G)) =k for some G € Og. 

It is easy to see that a bipartite graph H with bipartition {v,,...,v,} and 
{w,,..., W,} can be expressed as B(G) for some graph G if and only if H 
contains no edge v;w,; (1<i<n) and contains at most one of the edges in 
{u;w,, vjw,} for all 1<i<j<n. 

First, some additional notation and terminology will be described. If xy € 
E(G), then G—xy denotes the graph with vertex set V(G) and edge set 
E(G) — {xy}. If x #y belongs to V(G) and x is not adjacent to y, then G + xy 
will denote the graph with vertex set V(G) and edge set E(G)U {xy}. If 
xe V(G), then G — x denotes the graph with vertex set V(G) — {x} and edge set 
E(G) — {xy zy is adjacent to x}. Similar notations are used with G in place of G. 
It is well known that every tournament 7 can be canonically decomposed into 
(non-empty) strong components A,,...,A,,, for some m >1, so that V(A,)U 

--UV(A,,) is a partition of V(7T), and if m>1, then each vertex in A; 
dominates each vertex in A; iff 1<i<j<m. If m=1, T is said to be strongly 
connected (or strong). If m>1, we call A, the initial component and A,, the 
terminal component. The corresponding vertex sets in B(7) are denoted 
B,,...,B, and C,,..., C,. Acycle using every vertex of a strong component is 
called a spanning cycle of that component. If H is used to denote an oriented 
graph, then H will always denote its underlying (undirected) graph. An arc from 
x to y will be denoted x > y or xy, and we will say that x dominates y or that y is 
dominated by x. 


An affirmative answer is given to problem 1 in the following result. The proof is 
due to Thomassen [2]. 


Theorem 1. Let G be a graph and suppose that t is an integer satisfying 
m(G)<t<M(G). Then there exists G € Og so that k(B(G)) =t. 


Proof. The result is clear if |E(G)|=0. So assume that |E(G)|>0. First, note 
that if GeOg, if u,;— U2 is an arc in G, and if G* denotes (G -(u,—> 
uz)) + (Uz u,) (i.e. G* is the result of reversing the orientation of u,— u,), then 
|k(B(G*)) - k(B(G))| <1. To see this, note that B(G"*) = (B(G) - U1 Wz) + UW, 
(notation as in the introduction). Now, if v,w, is a cut-edge of B(G) and v, and 
w, are in the same component of B(G), then k(B(G*)) = k(B(G)) + 1. If v,w» is 
not a cut-edge of B(G) and v, and w, are in different components of B(G), then 
k(B(G*)) = k(B(G)) — 1. Otherwise, k(B(G*)) = k(B(G)); so, |k(B(G*))— 
k(B(G))| €1. 
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Let G' (respectively, G”) denote the orientation of G such that m(G)= 
k(B(G')) (respectively, M(G) =k(B(G’"))). G" can be obtained from G’ by 
successively reversing the orientation of single arcs, and the number of com- 
ponents in the successive resulting bipartite graphs starts with m(G) and ends 
with M(G). By the first remark, these numbers either increase by 1, decrease by 
1, or remain the same; that is, each number between m(G) and M(G) is 
assumed, so for some G € Og, k(B(G))=t. O 


3. 


Now, we turn to the issues concerning problem 2. First, we study k(B(7T)) fora 
tournament 7. The results are stronger than necessary, but the next two lemmas 
are included for their own interest. As an immediate consequence we obtain the 
values of m(K,) and M(K,,). The proof of Lemma 2 is straightforward and so is 
omitted. 


Lemma 2. Suppose that T is a nonstrong n-tournament, n = 3, with initial strong 
component A, and terminal strong component A,, for some k 22. Then 


1, if |A,| 23 and |A,| = 3, 
K(B(T)) =) 2, if (|Ai| =1 and |A,| = 3) or (\A,| =3 and |A,| = 1), 
3, if |Ai| =|A,| =1. 


Remark. It is easy to see that if 7 is the cyclic triple, then k(B(7T)) = 3. 


For strong n-tournaments, n > 4, we obtain the following result. Recall that if x 
is a vertex in a digraph D, then d7(x) (respectively, d5(x)) denotes the number 
of vertices in D dominated by x (respectively, dominating x). The subscript D will 
be deleted if the context is clear. 


Lemma 3. Let T be a strong n-tournament, n= 4, then 
2, if there exists arc yx in T such that d*(x)=d"(y)=n—-2 


se) 1, otherwise. 

Proof. First, suppose 7 contains an arc yx in T such that d*(x)=d7(y) =n —2. 
Suppose that a spanning cycle in T is given by u,u,---u,,u,. Without loss of 
generality we may assume that x =u, and y=u,. In B(T), v, is adjacent to w,, 
2<j<n-—1; each v,, 2<i<n—1, is adjacent to w,; and v, is adjacent to w,,,, 
2<i<n-—1. That is, v),...,Un—-1, W2,..., W, are in one component of B(T). 
And, edge v,, is the second component of B(7T). Note that the strong 
4-tournament is such a 7. 
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Now let 7 be an n-tournament, n2=5, containing no arc yx such that 
d*(x)=d™(y)=n-—2. We employ induction to prove that k(B(T))=1. For 
n=5, there are six strong n-tournaments, only four of which contain no arc yx as 
above (see [3, p. 92]); it is straightforward to check that k(B(7T)) = 1 in these four 
instances (e.g. B(T) is a 10-cycle if 7 is the regular S-tournament). So, suppose 
that n >6 and assume that the result holds for such strong tournaments of orders 
4 through n— 1. It is well known (e.g. see (3, p. 6]) that 7 contains a strong 
(n - 1). -tournament, say denoted W. Let a spanning cycle in W be given by 
UnU;**+U,U>. Let u, denote the vertex of T not in W. If k(B(W))=1, then 
clearly k(B(T)) = 1 since d*(y,) >0 and d-(u,)>0. If k(B(W)) #1, then by our 
induction hypothesis, W must contain an arc yx such that dj{x) = dj y) =n — 3. 
Without loss of generality we may assume that x =u, and y =u,, and moreover, 
by the remarks above, we see that the edge v,,w, (of B(W)) is a single component 
of B(W) and u2,..., Un-1, W3,-- +» W, are the vertices of the second component 
of B(W). Now, not both u,u, and u,u, are arcs of 7, as otherwise arc u,u, 
contradicts our assumption about 7. The three remaining possibilities are treated 
in turn: (i) u.u, and u,u, are arcs of 7, (ii) u,u2 and u,u, are arcs of T, (iii) uu, 
and u,,u, are arcs of 7. In case (i), since vw, and v,,, are edges in B(T) which 
connect the two components of B(W) and v, is adjacent to some w,, 2<j<n—1 
(since d*(u,)>0), we see that B(T) has one component. Case (ii) is similar to 
case (i) and utilizes the fact that d~(u,)>0. In case (iii), if usu, (respectively, 
u,u3) is an arc of 7, then v,w,u,W2v, (respectively, WU, W2U,,W,) is a 4-path in 
B(T) which connects the two components of B(W) with vertices v,, w,. That is, 
in case (iii), B(7) has one component. In any case, k(B(T))=1, and by 
induction the result follows. O 


Theorem 4, 
1, ifn=S, 
m(K,)= 42, ifn=1,4, 
3, ifn =2, 3, 
and 
, ifn=1, 
M(K.)={3 ifn >2. 
4. 


We now turn our attention to the following problem mentioned in Section 1 
after problem (2). Fix 2>1. For each k, m(K,)<k<2n, what is the least 
possible number f(n, k) of edges in a graph G with n vertices so that for some 
orientation G of G, k(B(G)) = k? We write m(K,,) because m(K,,) < k(B(G) ) for 
all orientations of all graphs with n vertices; however, as we showed above 
m(K,,) = 1 for n 25. It is easily seen that f(n, k)=2n ~k fori<n <4. 


Bipartite graphs 325 


Theorem 5. For n=1 and m(K,,) <k <2n, f(n, k) =2n —k. 
Proof. If G is a graph with 7 vertices and k(B(G)) =k for some G € Og, then 
|E(G)| = |E(B(G))| = S (IE(C)|:Ca component of B(G)} 
ay {(|V(C)| — 1:C a component of B(G)} 


=2n —k. So, f(n, k) 2=2n —k. 


To see that equality is possible a suitable graph is constructed. Suppose, first, that 
n is even, say n = 2m, for some m 23. For 0<j<4m —1 let G; be the oriented 


graph with vertex set u,, u2,..., U2, and exactly the first / arcs in the sequence 
UjUz, U3lz, Uzlg, Usa...) Urm—3am—2, Urm—1U2m—2, Urm—iUam, U3, 
U3Us,...) Urm—3am—1, Urm—i1ly, Ugly, Ugly, ..., Ur_—2Uom, UzmUz (the first 


2m — 1 terms describe an ‘antidirected’ spanning path, the next m terms describe 
a (directed) m-cycle, and the last m terms describe a (directed) m-cycle vertex 
disjoint from the previous one). Then it is easy to see that B(G,,,-,) Consists of a 
tree (actually a caterpillar) with 4m — k + 1 vertices together with k — 1 isolates. 
Thus, if G; denotes the underlying graph of G, 1<j<4m —1, then G,,,_, has 
n = 2m vertices, 2n — k = 4m — k edges, and k(B(Gsm_,)) =k. The case when n 
is odd is treated similarly. O 


5. 


A maximum version of the previous problem is the following problem: Fix 
n=1. For each k, m(K,,) <k <2n, what is the largest possible number F(n, k) of 
edges in a graph G with 7 vertices so that for some orientation G of G, 
k(B(G)) =k? It is straightforward to check that F(n, k)=f(n, k) for 1<n <3, 
m(K,,) <k <2n, and that 
f(4,k), ifS5<k<8 
f(4,k)4+1, if2<k<4, 

Moreover, for n =5, F(n, 1) = F(n, 2) = F(n, 3)= (3), by Theorems 1 and 4. A 
lower bound for F(n,k) is given next. For a real number r, we use {r} 


(respectively, [r]) to denote the least integer greater than or equal to r 
(respectively, the greatest integer less than or equal to r). 


F(4,k)= { 


Lemma 6. Letn =5 and 4<k<2n. Then 


ee ire Bera 
5) 2 - 2 , if4ske<n, 


(n-{=}) (n-[ 4], ifn+1<k<2n. 
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Proof. For n and k, n2=S and 4<k<2n, we define three vertex disjoint, 
oriented graphs A, B, C in order to construct an oriented graph G with n vertices 
such that k(B(G)) =k. If 4<k <n, let A be {(k — 1)/2} independent vertices, let 
B be a transitive (n — k + 1)-tournament, and let C be [(k — 1)/2] independent 
vertices. For 4<k <n, G, is constructed as follows: each vertex of A dominates 
each vertex in B and in C and each vertex of B dominates each vertex in C. Then 


Qian lew 


arcs, and B(G,) consists of one big component (with {(k — 1)/2} + 2(n —k +1) + 
{(k ~ 1)/2] =2n —k +1 vertices) and k —1 isolated vertices, i.e. k(B(G,)) =k. 
Now if n+1<k<2n, let A, B, and C be pairwise disjoint sets with |A|= 
n—((k—1)/2], |B}J=k-—n-—-1, and |C|=n—{(k—1)/2}, and form G, with 
vertex set AU BUC as follows: each vertex of A dominates each vertex in C. 
Then G> has (n — {(k — 1)/2})(n — [(k — 1)/2]) ares, and B(G>) consists of one 
big component (with |A|+ |C| =2n — k + 1 vertices) and |A| + 2|B|+|C| = 
isolated vertices, i.e. k(B(G,))=k. O 


In the remainder of this paper we show that equality actually holds in Lemma 
6. To do this we show (in Lemma 8) that if G is a graph with n vertices such that 
for some GeOg, B(G) has k components, fewer than k—1 of which ‘are 
isolates, then there is a graph H with V(G) = V(H) and |E(G)|<|E(H)| so that 
for some H € On, B(H) has k components and more isolates than does B(G). 
This implies that there is a graph G with n vertices, F(n,k) edges, and 
k(B(G)) =k for some G € Og, such that B(G) has k — 1 isolates (and exactly one 
non-trivial component if kK #2n). Then equality in Lemma 6 will follow from the 
next result. 


Lemma 7. Letn =5 and 4<k<2n., Let e(n, k) be the largest possible number of 
edges in a graph G with n vertices so that for some GEOg, B(G) has k 
components, k — 1 of which are isolates. Then 


5) >] 7 >| , if 4<k<n, 


e(n, k) = 2 2 


NG TEN, davies 


Proof. Clearly, if k = 2n, then e(n, k)=0, so assume that k <2n. Suppose that 
G is a graph with V(G) = {u,, u2,..., u,}. Recall that for an orientation G of 
G, our convention is to let those vertices of B(G) in V = {v,, v2,..., Uz} 
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(respectively, W={wW,, W2,..., Wp }). correspond to the rows (respectively, 
columns) of the adjacency writes of G. Suppose that G is a graph so that B(G) 
has k components, k — 1 of which are isolates, for some orientation G of G and 
that |E(B(G))| = e(n, k). Since we assume that k <2n, exactly one component C 
of B(G) is not an isolate. Let X and Y denote the sets of isolated vertices 
contained in V and in W, respectively. And, for vertex w in B(G), let N(w) 
denote the sets of vertices adjacent to w in B(G). 

Suppose that 4<k <n. Since the number of isolates is kK —1<n, C contains 
vertices uv, and w, for some t, 1<t<n. If for some i, 1<i<n, v;€ X and w,€ Y, 
then B(G) — {w,v:u € N(w,)} + {wu su € N(w,), v =u;} has k components, k — 1 
of which are isolates, and more edges than B(G), a contradiction. Hence we may 
assume that for all 1, 1<i<n, if v;¢X, then w,¢Y; and that if w,e Y, then 
v;¢X. Thus, |X|+|Y|=—1, and 


cayenne (3)-(4)-() (3 


(Pe) ((F 


To see that equality is possible, see the construction of G, in the proof of Lemma 


6. 
Suppose that n+ 1<k<2n. Since n+1<k, there exists t, 1<t<n, with 


v,eX and w,€ Y. If, for some i, 1<i<n, both v; and w, are vertices in C, then 
B(G) — {wv:u € N(w;)} + {w,u:u e N(w;)} 

has at least |E(B(G))| edges and k components, k —1 of which are isolates. 
Hence, we may assume that if v,;¢ V(C), then w; ¢ V(C), and that if w;e V(C), 
then v; ¢ V(C). Let Z = {i:u, € X, w, € Y}. Since (|X| — |Z]) +|Z| + (/Y¥| -|Z]) = 
n and |X|+|¥|=k-—1, |Z|=k-—1-—n. So, (|X| —|Z|)+(¥|-|Z|)=2n —k+ 
1. Hence, 

ed ed 


e(n, k)=|B(B(G))|< XI -IZI¥1 - IZ) = {= ; 


To see that equality is possible, see the construction of G, in the proof of Lemma 
6. 
The result follows. O 


The reduction procedure referred to prior to Lemma 9 is established next. 
Lemma 8. Suppose that n and k are integers, 4<k <2n. If G is a graph with n 


vertices such that for some G € Oc, B(G) has k components, fewer than k —1 of 
which are isolated, then there exists a graph H with n vertices such that for some 
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H€O,, B(A) also has k components, more isolates than B(G), and at least as 
many edges as B(G). 


Proof. Let V(G) = {u,, u2,...,u,}, let C be a nonisolate of B(G), let 5c be its 
characteristic function, and for O< a, B<1 define Nag= {j:1<j <n, d¢(v;) = 
a, 5-(w,) = B}. Of course, |NioU Nii] 21 and |No,. UMu| 21, as C is a non- 
isolate. 

First suppose that Nj, #@, say sE Ni. If {u,:2€ No} U (wj:j € Nio} consists 
entirely of |Noi| + |Nio| isolates, then all other non-isolate components of B(G) 
have vertex sets entirely in {v,:i € Noo} U {w;:/ € Noo}. In such a case suppose 
that v; is a vertex of another non-isolate component D#C, where i € Ny. 


Suppose vu; is adjacent to exactly w;,,...,w, for some r>0O and ji,...,j,€ 
Noo — {i}. For each j € Nio UN, none of v,,,..., v,, is adjacent to w;. Delete the 
r edges u,w,,,..., U,W,, in B(G) and add the |MioU N,,| -7 edges (uu Yi? JEN pV 


Ni, 1</ <r} to the resulting bipartite grap to obtain a suitable B(H) as in the 
statement of the lemma (v; is an isolate in B(H), V(C)U V(D) ~ {uj} are the 
vertices of one component in B(A), and otherwise components of B(H) are as in 
B(G)). So, let us assume that {v;:i € Noi} U {w,:/ € Nip} does not consist entirely 
of isolates, say, without loss of generality, that w, is in some non-isolate 
component D#C, where j € Ni. Say w; is adjacent in B(G) to p vertices in 
{u;:1€ Noo}, say U,,, Viz,---,U;,, and q vertices in {v, 7} € No}, say 
Uj, Uj» +++» Yj, Recalling that s€N,,, v, may be adjacent in B(G) to some of 
Wires Wj If g >0, choose notation so that for suitable gq, =0 and q,=0 with 
q =41 +42, U, is adjacent to each vertex in {w;:j € {j1, j2,-- +, J4,}} and v, is not 
adjacent to each vertex in {w):j € As ie elags = lat h= am B(G) delete edges 
Uj;,Wj, -. +5 U;,W,, add edges u,,W,,...,U,,Ws, for each We {jgyei +++ >Jqg} add 
edge u;w,, delete edges u,,w,,..., A w, and add edges u,w,,,..., u;w,,, and for 
each j € {j,,...,j4,} delete edge v,w w, and add edge vu; Ws to obtain B(H). Then 
w; is an isolate of B(H), V(C)U V(D) ~ {w i} Of B(A) is the vertex set of a 
component, and all other components are as in B(G). Thus, B(H) is a suitable 
bipartite graph. Note that this construction is valid if either p=0O or q=0 
(however, p + q >0). This completes the case in which Ni, #90. 

If any component C of B(G) satisfies N,,#, then proceed as above. So, 
suppose that for every component C of B(G), Ni, =. Let C be a non-isolate 
component of B(G). If {v;:i € Noi} U {w,:j € Nio} consists entirely of |No,| + |Miol 
isolates, then any other non-isolate component of B(G) has vertex set entirely in 
{v; 11 € Noo} U {w;:j € Noo}. In such a case, suppose that v;, is a vertex of another 
non-isolate component D#C, where v;,€ Noo. Suppose that in B(G), v,, is 


adjacent to exactly w;,,...,,,, for some r>0 and j,,...,j, € Noo — {io}. Note 
that u,, u),,..., u;, are independent in G and G since ie€ Nip and ji,..- 5), € Noo 
and w,; is an isolate in B(G). In B(G) delete edges v,,w, Vir-+ +> U;,W;, and for each 
i€ Ny, add the r edges u,w;,,..., u;W,,, to obtain B(H). Then v,, is an isolate in 


B(H), V(C)U V(D) — {v,,} is the vertex set of a non-isolate component in B(A), 
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and otherwise components of B(H) are as in B(G). That is, B(H) is a suitable 
bipartite graph. So, let us assume that {v;:i € No,} U {w;:j € Nio} does not consist 
entirely of isolates, say, without loss of generality, that w; is in some non-isolate 
component D#C, where j € Nig. Say w, is adjacent in B(G) to p vertices in 


{u;:i€ Noo}, Say U;,,---,U;,, and q vertices in {v;:/ € No}; Say Uj,,..-, UY, 
Now, by the assumption made about components in B(G) at the outset of this 
paragraph, none of v;,,..., ¥,, is adjacent to any of w,,,..., Wi, Moreover, v; is 
not adjacent to any of w,,,..., w;, (since w; is adjacent to v,,,..., ¥j,). But y, is 
in C, so there exists jy in No, so that v, is adjacent to w,, (so w;, is in C) and 
Jo#hi,+--+Jg- And u;,,...,Uj;,,uj,, are independent in G and G, since 
Uj,,.-++, Uj, are in D and w,, is in C. If g>0, then in B(G) delete the p edges 
Ui, Wy. +» UiW;, add the p edges WiWiy +s UW), delete the q edges 
Uj,Wj,--., U;,W; and add the q edges v,w,,...,u,w,, and add the q edges 


i Wij» , to obtain B(H). Now, for q>0, in B(H) w, is an isolate, 
V(C)U V(D) — {w;} is the vertex set of a component, and otherwise components 
are as in B(G). Thus, B(H) is as desired, provided q >0. Now, suppose that 
q=09, so that p>O (as w, is in a non-isolate component D). Recall that 


Uj,Wigr + +» Uj Wy 


V(C)UV(D) contains none of w,,...,™;,, (by our assumption concerning 
Ni, =9). In B(G) delete the p edges v;,,w,,...,u;,w;, and for each le 
{i,,...,4,} proceed as follows: if either v,, is not adjacent to w, or v;,w, is an 


edge whose removal doesn’t disconnect u,, from w,, then add edge v,w,, (and 
delete edge v,,w, if present); if v;,w, is an edge whose removal does disconnect u,, 
from w,, then delete v,,w, and add edges v,w,, and v,w,. In the latter case that 
part of the component of B(G) which contains w, when v,,w, is deleted is 
incorporated with C into a new component by addition of edge u;w,. This 
construction yields a B(H ) for a suitable H as desired. 

In any case, a suitable H is obtained from G. O 


Combining Lemma 6, 7 and 8, we obtain our last result. 


Theorem 9. Letn =5 and 4<k<2n. Then 


are eral 

7 2 - 2 , if 4sksn, 
2 2 

F(n, k)= 7 


ESOS). ra trctem 


Proof. Let G be a graph with F(n, k) edges and orientation G so that B(G) has k 
components. If fewer than k — 1 components of B(G) are isolates, then by use of 
Lemma 8, repeatedly if necessary, we may suppose that we can find a graph H 
with some orientation H so that B(H) has k components, k — 1 of which are 
isolates, and |E(G)|<|E(H)|. By Lemma 8, |E(H)|<e(n, k). Combining this 
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with Lemma 6 we have 
e(n, k) = F(n, k) = |E(G)| S$ |E(H)| Se(n, k), Le. 
F(n, k)=e(n,k). O 
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Let C, denote the cycle of length n, n =3. A C,-factor of a graph G is a spanning subgraph 
of G in which every component is isomorphic to C,. A graph G is C,-factorable if it is the 


union of edge-disjoint C,-factors. A spanning subgraph of G is a {C,, C,,... , C,}-factor of G 
if each of its components is isomorphic to one of the cycles C,, C,,...,C,. G is a 
{C,, Cy, ..., C,}-factorable if it is the union of edge-disjoint {C,, C,,..., C,}-factors. In 


this paper we present a sufficient condition for a group graph to have a C,-factor and a 
sufficient condition for a group graph to be C,-factorable. We also investigate 
{C,, C,,..., C,}-factorability in such graphs. 


1. Introduction 


A spanning subgraph F of a graph G is an {A, B,..., P}-factor of G if each 
component of F is isomorphic to one of the graphs A, B,..., P. In particular, if 
there exists a subgraph H of G such that every component of F is isomorphic to 
H, then F is said to be an H-factor of G. G is {A, B,... , P}-factorable if it is the 
union of edge-disjoint {A, B,..., K}-factors and the {A, B,..., P}-factors are 
said to comprise an {A, B,..., P}-factorization of G. G is H-factorable if it is 
the union of edge-disjoint H-factors and the H-factors are said to comprise an 
H-factorization of G. One of the np-problems mentioned by Akiyama and Kano 
in [1] is that of determining whether a given graph has a C,-factor, where C, 
denotes the cycle of length n, n >3. We study this problem for the special case 
where G is a group graph. In this paper, we present a sufficient condition for a 
group graph to have a C,,-factor and a sufficient condition for a group graph to be 
C,-factorable. We also investigate {C,, C,,..., C,}-factorability in such graphs. 

A graph (G, R,) with vertex set G and edge set R, is a group graph if Gis a 
group, A is a subset of G, and for a, b€G, (a, b) eR, if and only if a"'beE A. 
Since we wish to confine ourselves to finite, simple, connected and undirected 
group graphs in this paper, we require that G is finite, e¢A where e is the 
identity of G, (A)=G where (A) denotes the set of all finite products of 
elements of A, anda ¢ A >a™'e€A, respectively. A discussion of the equivalence 
of these conditions to the given properties can be found in the textbook [2] by 
Teh and Shee. 
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2. C,,-Factors 


Theorem 2.1. /f there exists se A such that o(s) =n =3, where o(s) denotes the 
order of s in G, then (G, R,) has a C,,-factor. 


Proof. Let se A with o(s)=n and let H, denote the subgroup of G generated 
by s. 


H, = {e, s,s*,...,5" '}, 
where e is the identity of G. Let 
G =aj,ll, Vail, VUanH, U:--Ua,H, 


be the coset decomposition of G by H,, where ay =e and k 0. The cosets in the 
decomposition are disjoint and each vertex of (G, R,4) belongs to exactly one 
coset. 

To prove the theorem, it is sufficient to show that for any i, 0<i<k, there 
exists a cycle of length n which spans the vertices of (G, R,) belonging to a;H,; 
and furthermore, that if i#j, then the cycle spanning the vertices in a,H, is 
disjoint from the cycle spanning the vertices in a,H,. 

Consider the subgraph of (G, R,4) with vertex set 


a;H, = {a,, a;s, as’, ..., as"~'}, 
0<i<k, and edge set 

{(@,, a5), (@;S, a;s”),..., (a,s"~', a,)}. 
Clearly, this is a cycle of length n in (G, Ry). If i#j, a,H,a,H, =@, hence the 
cycles 

(a;, a8, as”, ... , as 
and 

(a;, 4,5, ajs°,...,a)s"~", aj) 


are disjoint. Thus the collection of cycles of length n obtained from a;H,, 
0<i<k, comprises a C,-factor of (G, Ry). O 


This theorem implies that (G, R,) has a C,-factor for any n, n 23, which is the 
order of some element of A. Note that an element of order two in A generates a 
P,-factor of (G, R,), where P, is the path with two vertices. In the literature, a 
P,-factor is also Known as a 1-factor, i.e. a factor in which all vertices have degree 
one. 


Example 2.1. The dihedral group D,=(a,b:a"=e, b*=e, (ab)’=e) can 
always be used to produce a group graph with a C,,-factor, for any n > 3. In fact 
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the group graph generated by D, is {P), C,}-factorable. With n=6 and 
A = {a, b, a°}, we obtain the group graph (D,, R4) shown in Fig. 1. 


3. {C., Cy, ++«, C,}-Factorability 


Theorem 3.1. /f a,b,...,p are the distinct orders of the elements of A and 
a,b,...,p 23, then (G, Ry) is {C,, Cy, ... , C,}-factorable. 


We first introduce some notation and prove lemmas which will simplify the 
proof of this theorem. If s€ A with o(s) =n 3, let C),, denote the C,,-factor of 
(G, R,) generated by s. Let s~' denote the inverse of s in G. 


Lemma 3.1. H, = H,-1 and Cis} a Cis- 

Proof. Since s-'=s""' and -i=n-i (modn), we have H,=H,-: and 
(a;, ajs, a;s”, eeey a;s"—', a;) = (a;, a;s~”, eeey as"), a;) for any a; € G. Hence 
the lemma follows. O 
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Lemma 3.2. /f C,,; and C,, are not edge-disjoint, then t=s or t=s', in 
particular o(s) = o(t). 


Proof. Let 


n-1 


k 
C= U (a, ajs, ajs’,..., a,s""', a;) 
prs 


and 
1 
Ci = U (6,, bjt, bt?,... , Bt", By). 
p=0 


Suppose C),; and C;,,; are not edge-disjoint, then there exist i, j, q, r with 
O<i<k, 0<j<l, 0<q<n-1, 0<r<m~—1 such that 

(a;s7, a,s?*') = (bt’, bt’*"). 
This implies that either 


(a) as?=bt) and as?*'=5,t'*! 
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or 
r+ v 
(b) as? =5,t'*" and as?*! =’. 


From case (a), we have a,s?*' = (b,t’)t = (a,s?)t or s = t. From case (b), we obtain 
ajs? = (byt’)t = (ajs?*")t or e=st. Hencet=s"'. O 


We can now return to Theorem 3.1. 


Proof of Theorem 3.1. By Theorem 2.1, each s€A generates a C,,,)-factor of 
(G, R,). By assumption, the components of this factor are isomorphic to one of 
the cycles C,,C,,...,C,. By Lemmas 3.1 and 3.2, either these factors are 
disjoint or they are equal. Hence, to prove that (G, R,) is {C,,C,,..., C,}- 
factorable, it remains to show that every edge of (G, R,) belongs to a C,,,)-factor 
of (G, R,) for some t EA. 

Let (x, y)€R,, then x~'ye€A which implies that x~'y =¢ for some teA. 
Hence y = xt and (x, y) = (x, xt). Since C,,, is a factor of (G, R,4) and x eG, then 
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x is a vertex of C;,. Thus x = 5,t’ for some j and r, O<j</, O0<r<m-—1 and 
xt =b,t’*'. Finally, (x, y) = (b)t’, bjt’*') which is an edge of Cy, O 


Example 3.1. In the group G = (s, t:s°=e, sts~'=27), o(s)=3 and it can be 
shown that o(t)=7. Using A = ({s, t, 5”, 1°}, we obtain the {C;, C,}-factorable 
graph (G, R,) shown in Fig. 2. 


If some element of H has order two, then (G, Ry) is {P,, C,, C,,..., C,}- 
factorable, where a, b,..., p are the distinct orders of the other elements of A. 

We obtain a sufficient condition for (G, R,) to be C,,-factorable as a corollary 
to Theorem 3.1. 


Corollary 3.1. [f for every s € A, o(s) =n 2 3, then (G, R,) is C,-factorable. 


Since we consider only connected group graphs, this corollary actually requires 
that every generator of the group G be of order n. Note that if every generator of 
A is of order two, then (G, R,) is P,-factorable, or equivalently, is 1-factorable. 


Example 3.2. The group G = (a, b: a*=b*=e, ab =ba) with A = {a, b, a’, b*} 
generates the C,-factorable group graph shown in Fig. 3. 
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First, Cogis and Habib (RAIRO Inform. Théor. 13 (1979), 3-18) solved the jump number 
problem for series-parallel partially ordered sets (posets) by applying the greedy algorithm and 
then Rival (Proc. Amer. Math. Soc. 89 (1983), 387-394) extended their result to N-free posets. 
The author (Order 1 (1984), 7-19) provided an interpretation of the latter result in the terms of 
arc diagrams of posets explaining partly tractability of this special case. 

In this paper, we present an algorithm for solving the jump number problem on arbitrary 
posets which extends the author’s approach applied to N-free posets and makes use of two new 
types of greedy chains in posets introduced in companion papers [8, 9]. Complexity analysis of 
the algorithm supports our expectation that, going from N-free to arbitrary posets, thc 
complexity of the problem increases with the number of dummy arcs required in their arc 
diagrams. The algorithm works in time which is linear in the poset size but factorial in the 
number of dummies, therefore it is a polynomial-time algorithm for posets with bounded 
number of dummies in their arc diagrams. 


1. Preliminaries 


A partially ordered set (P, <) is in this paper simply written as P and called a 
poset. We assume also that all sets are finite. A linear extension of P is a total 
ordering L=p,p2-::p, Of P preserving the relation, that is if p; <p, in P then 
i<j. A pair (p;, pi+,) is a jump of L if p; € p;,, in P. The jumps partition L into 
chains C; of P, so we can write L=Cy+ C,+---+C,. The jump number s(P) of 
P is equal to minimum &k taken over all linear extensions L of P. A linear 
extension L of P is optimal if it has s(P) jumps. The jump number problem 
consists in evaluating s(P) and constructing an optimal linear extension. Although 
the problem is NP-complete (see Pulleyblank [4]), there exist polynomial-time 
algorithms for some special classes of posets defined by forbidden substructures 
(see Rival [5] and also [6]) or restricted by bounding some of their parameters 
(see [1] and [2]). The aim of this paper is to provide yet another algorithm of the 
latter type which generates special linear extensions introduced in [8]. 

We now define some basic graph-theoretic notions since we shall make a 
significant use of digraph representations of posets. A digraph may contain loops 
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and multiple arcs, so it is defined as D =(V, A, t, 4), where V is the vertex set, A 
is the arc set, and ¢, A (t for tail and h for head) are two incidence mappings 
t,h: A—V. An arc aeéA is of the form a=(t(a), h(a)). A sequence of arcs 
= (a), @,...,@,) is a path of length / if h(a,;) =¢(a,,,) for i=1,2,...,1-1. 
The path a begins with the arc a, and also with the vertex ¢(a,). Similarly, 2 
terminates with a, and with hA(a,). Therefore, we can write t(7)=t(a,) and 
h(x) = h(a,). A digraph is acyclic if it contains no path of length greater than 1 
and such that A(a,)=1t(a,). The transitive closure of D is denoted by tc D= 
(V, tc A, t*, h*), where (a,, a,,...,a,), 1/21, is a path in D if and only if tcA 
contains an arc 6 such that ¢*(a,)=¢*(b) and h*(a,;)=h*(b). Let us denote 

*=1tc(A)U {(u, v):veE C}. 

Posets can be represented by graphs and digraphs. In this paper we make a 
significant use of digraph representations (called arc diagrams) in which the poset 
elements are assigned to arcs and the relation is preserved along the paths of the 
digraphs. Formally, an arc diagram of a poset P is an acyclic digraph 
D4(P)=(V, R,t,h) without loops (but possibly with parallel arcs) and a 
mapping @: P— R such that for every p, q €« P, p#q we have 


p <q in P iff (h*(o(p)), t*(9(q)))€ R*, 


where ¢*, h* are the incidence mappings of tc D4(P) and R*=tc(R)U 
{(v, u):u eV}. For the sake of simplicity, we shall denote an arc diagram by a 
digraph D“(P) in which certain arcs (drawn in solid lines) are labelled by the 
poset elements in the way which preserves the relation between them along the 
paths in D4(P). Let us denote §S = R — o(P). An arc ae @(P) is a poset arc and 
otherwise a is called a dummy arc (and drawn in dotted lines). A path a in D4(P) 
is a poset path if it consists entirely of poset arcs. For the sake of completeness we 
define also the vertex diagram (known also as a Hasse diagram) of P to be a 
digraph D”(P) =(P, A) in which (p, q)€ A if and only if q covers p in P. (Note 
that, since vertex diagrams contain no parallel arcs, we can drop out incidence 
mappings from their definition.) Figure 1 shows vertex and arc diagrams of some 
sample posets. 

Arc diagrams of posets are best know in network analysis as event or PERT 
networks, where they are used mainly for time analysis of projects whose 
precedence relations are represented by activity-one-arc networks. In poset 
theory however, arc diagrams have not been widely accepted as poset repre- 
sentations. The main reason behind this is that, in contrast to the uniqueness of 
vertex diagrams, a poset may have an infinite number of arc diagrams and in fact 
there is no standard one. Moreover, finding an arc diagram with the minimum 
number of dummies (which may be blamed for non-popularity of arc diagrams) is 
an NP-complete problem in general. 

Recently, it was demonstrated by Mohring [3] that certain hard combinatorial 
problems can be efficiently solved on comparability graphs by using arc diagrams 
of corresponding posets. For the jump number problem, arc diagrams have been 
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Fig. 1. Vertex and arc diagrams of some posets. 


used in [6] to simplify the solution and derive the number of jumps in N-free 
posets. (N-free posets have arc diagrams with no dummy arcs, see Section 4 in 
[7].) Then, this approach has been also used in [7] to attack the problem for 
arbitrary posets. Finally in [8], a new proper subfamily of linear extensions has 
been defined on an arc diagram of a poset that contains an optimal solution and 
significantly reduces the search space. The result of [8] have been reformulated 
and improved in [9], where considerations are carried on directly on posets. 

The purpose of this paper is to present an implementation of the results of [8] 
and [9], and discuss its complexity. 


2. Greedy paths and linear extensions 


We assume in the sequel that a poset P is represented by an arc diagram D“(P) 
which: 

(i) has exactly one source and exactly one sink, and 

(ii) for every dummy arc a, indeg(h(a)) > 1 and outdeg(t(a)) > 1. 
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Fig. 2. Poset and its two compact arc representations. 


An arc diagram D“(P) which satisfies conditions (i) and (ii) is called compact. 
Note that we weakened the conditions imposed on arc diagrams to be compact 
formulated in [8]. For instance, both arc diagrams shown in Fig. 2 are compact, 
whereas the first one is not compact in the sense defined in [8] since dummy arc x 
can be contracted. 

An arbitrary arc diagram of a poset can be easily transformed to its compact 
form. Moreover, an arc diagram which has minimum number of vertices is 
compact and can be produced for every poset by a polynomial-time algorithm, we 
refer the reader to Section 4 of [7] for details. 

A chain C of a poset P is greedy if there are no elements pe P— C and qgeC 
such that p <q and moreover for no r which covers sup C in P, the chain C U {r} 
has this property. A linear extension L= Co+C,+---+C, of P is greedy if C; is 
a greedy chain in P — \j,<, C;. It is easy to see that every poset has an optimal 
linear extension which is greedy. It was proved in [8] that for every poset the class 
of greedy linear extensions can be further restricted to the class of semi-strongly 
greedy ones which contains an optimal one. The purpose of this paper is to 
provide an efficient algorithm for generating an optimal semi-strongly greedy 
linear extension of a poset. 

A semi-strongly greedy linear extension consists of two types of greedy chains 
which are naturally defined in the terms of arc diagrams. Let D4(P)=(V, R, t, h) 
be a compact arc diagram of a poset P. A path m=(x,,X2,...,X,), k 21 of 
D*“(P) is called a greedy path if it has the following properties: 

1. no vertex of except A(x,) is a head of any arc ye R, y #x; for every j, 

0<j<k; 

2. X, is a poset arc; and 

3. m cannot be extended to a path which satisfies 1 and 2, and has more poset 

arcs than z has. 

Let C,, denote the sequence of poset arcs of the path z. It is easy to see that a 
greedy path a in a compact arc diagram D“(P) of P contains no dummy arcs. 
Moreover, the corresponding C,, is a greedy chain in P. On the other hand, every 
greedy chain C of P generates a greedy path in a compact arc diagram 
D4(P)-let 2(C) denote the corresponding path. Thus, we have a one-to-one 
correspondence between greedy chains of a poset and greedy paths in its compact 
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arc diagram. Therefore, we can use greedy paths and greedy chains inter- 
changably. 

A poset is N-free if its vertex diagram contains no induced subgraph isomorphic 
to the vertex digram N -see Fig. 1(a). N-free posets are precisely those which 
admit arc diagrams with no dummy arcs (see [6]). In what follows we assume that 
an N-free poset is represented by such a diagram. 

If a poset P is N-free then every greedy linear extensions L of P is optimal 
(Rival [5]). In the other words, an optimal linear extension of an N-free poset P 
may begin with an arbitrary greedy path of D“(P). We now identify some greedy 
chains in arbitrary posets which share this property. 

A greedy path z in a compact arc diagram D“(P) of a poset P is called strongly 
greedy if, additionally to conditions 1-3, z satisfies: 

4. either (a) A(z) is the sink of D4(P), 

or (b) A(z) is the head of a poset arc b (6 #a,) such that every path 
terminating with b has no vertex which is incident with a 
dummy arc. 

In particular, 7 is strongly greedy if 7 is greedy and A(z) terminates a greedy 
path 2’ (2' #2) whose no vertex is a tail of a dummy arc. The diagram in Fig. 
1(c) shows that not every poset has a strongly greedy path. The main property of 
strongly greedy paths is described in the following theorem. 


Theorem 1 [8]. /f D4(P) admits a strongly greedy path x then there exists an 
optimal linear extension of P which begins with C,. 


We proved in fact that for a strongly greedy path a in D“(P), every greedy 
linear extension L of P can be transformed to another linear extension L* of P 
which begins with C, and L* has no more jumps than L. The examples in [8] 
show that such a transformation may result in decreasing the number of jumps in 
linear extensions. Since every N-free poset P admits a strongly greedy path a and 
a subposet P — C,, of P is also N-free, Theorem 1 provides another proof of the 
formula for the jump number of N-free posets, see [6] and [8]. 

If D4(P) contains no strongly greedy paths then D“(P) contains greedy paths 
which in a certain sense are better than the other. Note that if a is a greedy path 
and 7 contains a vertex u which is a tail of at least one dummy arc but not a head 
of any dummy arc then taking C, to a linear extension of P results in removing 
from D“(P) (together with 2) all dummy arcs incident with v. More precisely, a 
greedy path z is called semi-strongly greedy if: 

4’, « contains a vertex which is a tail of a dummy arc but not a head of any 

dummy arc. 

For instance the path (5, d) in Fig. 1(a) and the paths (p,), (p2) and (ps) in 
Fig. 1(c) are semi-strongly greedy. Note that all strongly greedy paths in diagrams 
of Figs. 1(a) and (b) are also semi-strongly greedy. An arc diagram may not 
contain a semi-strongly greedy path, e.g. when it has no dummy arcs. It is 
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however easy to show that every arc diagram which has no strongly greedy paths 
contains a semi-strongly greedy path. A formal proof of this fact given in [8] is 
based on elementary properties of acyclic digraphs formed by dummy arcs in 
compact arc diagrams. 

A counterpart of Theorem 1 for semi-strongly greedy paths has a weaker form 
and says that in the absence of strongly greedy paths in D“(P), a good candidate 
to begin an optimal linear extension is one of the semi-strongly greedy paths in 
D4(P). Formally we have 


Theorem 2 [8]. /f an arc diagram D“(P) of P contains no strongly greedy paths 
then P has an optimal linear extension which begins with a semi-strongly greedy 
path. 


Theorems 1 and 2 guarantee that every poset has an optimal linear extension 
L=CotC,+--++C,, herafter called semi-strongly greedy such that each chain 
C, is strongly greedy in P, = P —\,<;C; or semi-strongly greedy in P, if P, has no 
strongly greedy chains. In the next section we present an algorithm which finds an 
optimal linear extension of a poset among its semi-strongly greedy ones. 

We conclude this section with a lower bound to the jump number which is also 
suggested by arc diagrams. 


Observation 1. [f D4(P)=(V, R, t, h) is an arc diagram of a poset P then 
5. max{0, indexp(v) — 1} <s(P), (1) 


veV 


where indegp(v) is the number of poset arcs coming into v. 


Proof. Note first that if indegp(v)>1 (veV) then indegp(v) cannot be 
decreased by any greedy path which does not terminate at v. On the other hand, 
the removal of a greedy path from D“(P) that does not contain v results in a 
compact arc diagram in which indegp(v) is unchanged. Hence, only a greedy path 
terminating at v may reduce indegp(v) and each such a path contributes one to 
the jump number of P. This proves the bound. O 


The bound (1) can be further improved by adding 1 to the left hand side if, 
after removing strongly greedy paths from D“(P), the reduced diagram admits 
only semi-strongly greedy paths a whose terminal vertices A(z) are of total 
indegree 1 (in this case, the only other arcs incident with A(7) are dummies which 
go out of A(sz)). 


3. An algorithm 


A search method for finding for a poset a semi-strongly greedy linear extension 
with the minimum number of jumps is implemented in procedure OPTLINEXT. 
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A poset P is given by its compact arc diagram D = (V, R, t, h). Such a diagram can 
be produced by a polynomial-time algorithm which constructs arc diagrams with 
minimum number of vertices, see Section 4 of [7] for a survey of construction 
methods of arc diagrams. An arbitrary arc diagram can be easily transformed to a 
compact form by contracting dummy arc a for which indeg(h(a))=1. Such a 
transformation can be applied in the algorithm whenever a current diagram 
(which is compact) is reduced by the removal of a (strongly or semi-strongly) 
greedy path. The diagram is represented by adjacency lists of arcs coming to and 
going out of each vertex. It is convenient to partition each list into two sublists of 
poset and dummy arcs, respectively. 

Greedy paths (strongly and semi-strongly) in D can be easily identified by 
inspecting the vertices of D in a topological order, that is, when visiting vertex uv 
we can be sure that all its predecessors have been already visited. To keep track 
of greedy paths, we define two queues S and W which store strongly and 
semi-stongly greedy paths in the current diagram. The paths in S and W are 
uniquely represented by their end-arcs. Statement 1 in the procedure is assumed 
to find all these quantities in the given arc diagram and then procedure 
REMOVE takes care of updating S and W. The number of jumps in a linear 
extension L of P is denoted by S(L, P). 

procedure OPTLINEXT(arc diagram: D, linear extension: var L*); 

{The procedure finds an optimal semi-strongly greedy linear extension L* of 
a poset p represented by its arc diagram D.} 

procedure REMOVE(a, var D, S, W); 

{This procedure removes greedy path a from arc diagram D, transforms 
D—« into a compact form and updates queues S and W of strongly and 
semi-strongly greedy paths in the current arc diagram. Implementation 
details of this procedure are left to the reader. } 

procedure SUBLINEXT(z, L, D, S, W); 

{The procedure extends L with a semi-strongly greedy path 2 removes 2 
from D, extends L with strongly greedy paths which may result and then 
attempts to repeat this process. } 

begin 

augment L with z; 
REMOVE(z, D, S, W); 
while S #9 do begin 
5<S; augment L with 6, REMOVE(6, D, S, W) end; 
if D #9 then 
for every p in W do 
SUBLINEXT(p, L, D, S, W) 
else if s(L, P) <r then begin r —s(L, P); L*<—L end 
end {SUBLINEXT}; 
begin 
L<—S<—W <9; {L, S, W are handled as queues} r <=; 
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1. for ve V(D) do {vertices in V(D) are in topological ordering} 
update S and W; 
2. while S #9 do begin 
5<S; augment L with 6; REMOVE(4, D, S, W) end; 
3. if D contains no arc then begin r<—s(L, P); L*<—L end 
else for every 7 in W do 
SUBLINEXT(z, L, D, S, w) 
end. 


Theorem 1 guarantees that every strongly greedy path of D can be removed 
from D and added to L as a part of an optimal linear extension. Statement 2 of 
OPTLINEXT takes care of this step. The removal of a greedy path a from an arc 
diagram D is done by procedure REMOVE which, if necessary, transforms also 
D — & into a compact form and updates queues S and W. We leave implementa- 
tion details of the procedure REMOVE to the reader. 

The removal of strongly greedy paths from D may lead to exhausting all poset 
elements (arcs) of the diagram and this can happen not only for N-free posets. 
Otherwise, procedure OPTLINEXT attempts to build an optimal linear extension 
starting with an arbitrary semi-strongly greedy path a of D. This is done by the 
call of procedure SUBLINEXT(z, L, D, S, W) which: augments partial exten- 
sion L with 2, removes a from D and repeats recursively this process. To this 
end, procedure REMOVE is called, L is extended with strongly greedy paths of 
the reduced diagram and finally, if the current diagram D is non-empty, 
procedure SUBLINEXT is called again. 

Regarding the time complexity of the algorithm note that each sequence of 
recursive calls of SUBLINEXT is successful in the sense that a semi-strongly 
greedy linear extension of the poset is obtained. Hence, it is easy to see that the 
time spent by procedure OPTLINEXT on producing one complete semi-strongly 
greedy linear extension of the given poset is bounded by O(n + k), since one has 
only to reduce the diagram to the empty set by removing greedy paths. A poset 
with k dummy arcs can have at most k semi-strongly greedy paths, and the 
removal of a semi-strongly greedy path reduces the number of dummy arcs by at 
least one. Hence, there are at most k! semi-strongly greedy linear extensions 
produced by the procedure and thus, total time complexity of the algorithm is 
bounded by O(k! (2 + k)). This estimation is very rough since it assumes that no 
dummy arc is reduced from the diagram by strongly greedy paths and that exactly 
one dummy arc is involved with a semi-strongly greedy path. 

The performance of the algorithm implemented in OPTLINEXT can be 
improved at least in two directions. First, it is not necessary to go forward if a 
partial linear extension L has already the same number of chains as L*, the best 
one found so far. Secondly, the algorithm may be terminated when a linear 
extension is found which has the number of jumps equal to the lower bound (1). 

As an illustration, let us apply the algorithm to the arc diagram D of the 
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Fig. 3. Illustration of the algorithm. 


3-crown which is hown in Fig. 1(c). D has no strongly greedy path and contains 
three semi-strongly greedy paths (p,), (p2), and (p3). The diagram is symmetric 
with respect to each of these paths, what is not however recognized by the 
algorithm. We first choose the path (p,). Fig. 3(b) shows the compact arc 
diagram D, of the poset reduced by removing (p,). Note that this reduction 
results in removing four dummy arcs from the diagram. D, contains two strongly 
greedy paths (p2, q,) and (p3, q2) since q, and q2 are maximal. Removing any of 
them we obtain a diagram which contains no dummy arcs. Therefore, the 
algorithm will produce only 3 complete linear extensions (instead of 6!), all with 
the same number of jumps. Note that in this case left-hand side of (1) can be 
increased by 1 since all semi-strongly greedy paths of the diagram terminate at 
vertices of indegree 1. Therefore, the algorithm may terminate after producing 
the first linear extension. 


4. Conclusions 


We have presented the algorithm for solving the jump number problem which 
is linear in the poset size and factorial in k, the number of dummy arcs in arc 
representations of posets. Therefore it is a linear-time algorithm in the class of 
posets with bounded k, which may be considered as a degree of non N-freeness 
of posets. The algorithm works directly on an arc diagram of a poset and searches 
only a portion of the solution space which consists of semi-strongly greedy linear 
extensions. 

There exist in the literature two other algorithms for the jump number problem 
which are polynomial in certain restricted classes of posets. Colbourn and 
Pulleyblank [1] presented a dynamic programming algorithm which works on an 
arbitrary chain partition of a poset and finds the jump number in O(n m7) time, 
where n is the poset size and m is the number of chains in a partition. In 
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particular, one may apply this algorithm to a partition with the minimum number 
of chains which can be found efficiently by a bipartite matching algorithm. 

Another algorithm, presented by Habib and Mohring [2], makes use of the 
substitution decomposition of posets into prime (i.e. indecomposable) posets and 
solves the problem in time which depends polynomially on the poset size but is a 
highly non-polynomial function in /, the maximum size of a prime poset. 
Unfortunately, the approach of Habib and Mohring requires the knowledge of 
the jump number of all prime posets of size at most /. 

It is difficult to compare our algorithm with the two just mentioned above since 
there are no evident relations between the poset parameters which occur in the 
time complexity formulae of these algorithms. (We note only that a poset with 
bounded m can have an arbitrary k and conversally.) As an advantage of our 
approach over the two others we can list: very little preprocessing of the data, the 
existence of deterministic steps which depend on augmenting linear extensions 
with strongly greedy chains and, at last but not at least, its linear complexity in 
the poset size. 
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1. Introduction 


The study of large scale networks has been mainly motivated by practical 
problems, like transportation problems, reliability problems. The problems 
usually involve finding the optimal paths in the networks and they are rather 
similar in nature. These different networks can be unified into a more general 
form of network, the semiring network. Attempts to describe such networks in a 
general setting is not new. In [13], Shier has described an algebraic structure to 
study reliability problem. Carré {6] has given an excellent description of semiring 
networks and their properties using matrices. In this paper, we shall describe 
semiring networks and show how to deal with large matrices. The latter is 
particularly important because in large-scale networks, even computation on the 
computer presents some difficulties as the amount of random access memory in 
every computer is limited and the computation time may be long. 


2. Semirings 


Let S be a non-empty set with two binary operations, ® and ®. The system 
(S, ®, ®) is called a semiring if 

(1) (S, ®) is a commutative semigroup, 

(2) (S, ®) is a semigroup, 

(3) for any a, b,ceS, 


a®@(b@c)=(a®@b)@(a®c) 
(bBc)@a=(b Ba) P(c @a). 
For any pair of integers, n and m, let M,,,,,, be the class of all n x m matrices 
over the semiring (S, ®, ®). Define the operation © on M,,,., aS usual: if 
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A =[a,] and B =[b,] are matrices in M,,x,,, let 


i=1,...,n, 
A@®B=[a;,@Ob,|,_ . 
® [a,, ® i] j=l,...,m. 
Then (Mm, ®) is a commutative semigroup. 
Let n, m, k be three given positive integers. Let A =[a,] be a matnx in M, xm, 
B =[b,] be a matrix in M,,,.,. Define the matrix C = [c;] as follows: 


Ci = @ (ai, ® b,;). 


r= Li 


Let this operation be denoted by C=A®B. In the case where m=k=n, 
(M,,xn, ®) is a semigroup. Hence the following theorem holds. 


Theorem 2.1. Let (S,®,®) be a semiring. For a given positive integer n, 
(Mi xn, B, ®) is a semiring. 


Suppose n is a factor of two numbers, p and m, which are greater than 1. For 
each matrix, A =[a,], in M,,, define a collection of p x p matrices A; where 
i,j=1,...,m, as follows. 

Ay = [@G-1)p+h,j—-p+kl> h, k= 1, --+, Dp. 


Then the matrix A®B can also be obtained as specified in the following 
theorem. 


Theorem 2.2. Let A, B be matrices in M,,, wheren=p xm. Then C=A@QB 
iff 
Cj= @B (Ar ® B,;). 


r=1,.... 


This theorem is very useful for very large matrices because it gives us a way of 
decomposing the matrices. 


3. Semiring networks 


Let Q denote a given finite directed graph with vertex set V({2) and edge set 
E(Q). In this paper, we assume that V(92) = {1,...,}. Suppose S, and S, are 
two semirings whose additive semigroups have zero elements (i.e. additive 
identity elements) and whose multiplicative semigroups have identity elements. 
Let w be a mapping such that 


w:V(Q)—>S, 
w:E(Q)> 5. 


Large scale network analysis 349 


The function is called the weight function and the ordered pair (2, w) is called 
a semiring network. 

For a semiring network (92, w) over the semirings (S,, ®, ®) and (S,, ®, ®), 
let P(S2) denote the class of all paths of the directed graphs 22. Extend the weight 
function, w to paths of 2 as follows: if P= (vo, v,,..., Ux) is a path in P(Q), 
define 


w(P) = w(vo, V1) @ w(vy4, v2) @- - - @ w(vg_1, wy). 


Observe that w maps P into an element of S,. 

In some applications, the zero elements may not exist, but this can be easily 
remedied. Consider an element A that satisfies the following properties with the 
operations in (S,, 8, ®): 


ABA=A 
ABA=A 
a@®A=A@a=a forallainS,, 
aQ@A=ABa=A forallain S,. 


Then (S, U {A}, ®, ®) is also a semiring. Hence we may include A in the set S, 
as the zero element in the semiring. Now, we are ready to define matrices for 
representing networks. 

Let (Q,w) be a given semiring network. Suppose Q has 1 vertices, 
V(Q) = {1,2,...,}. Define its associated matrix A=[a,], i, j=1,...,n, as 
follows: 


fori=j, a, =identity of (S,, ®), 
w(i, j) if (i,j) € W(Q), 


fori #j, a;=|{ : 
rete 7 OLA otherwise. 


Given an assOciated matrix of a semiring network, define a sequence of n Xn 


matrices, Ay, A,,...,A,;,... as follows: 
Ag =A 
A, = Apo ® Ao 
Ares =A, QA, 
We say that the sequence Aj, A,,A2,... converges to A* if for some 


non-negative integer 1, 
A* =A,=Ajs) = Ayy2= eo 


In such a case, we call A* the induced matrix of (Q, w). 
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4. Basic probiems of semiring networks and their applications 


Consider some basic problems of the semiring network. For any given semiring 
network, its induced matrix does not always exist. So the problem is, under what 
condition does it exist? If it exists, what are the uses of the induced matrix? 
Finally, how can we compute A*? 


Existence of the induced matrix A* 


It is a well known fact that the induced matrix does not necessarily exist, as 
testified by the shortest path problem for the transportation network that contains 
negative cycles. For what types of semiring networks does the induced matrix 
exist? In [6], Carré has studied this problem. We shall give a brief outline of the 
ideas presented in his paper. 

In most of the applications (path problems, reliability problems, network flow 
problems), the additive operation ® is usually either the minimum or maximum 
operation. Hence, to generalize this idea into semiring networks, we may assume 
that a ®a=a and introduce an ordering (or partial ordering) to the semiring. 
This ordering is determined by the additive operation ®. 

For a semiring, define an order relation as follows: 


axbifa®b=a, 
and define the strict order relation as 
a<bifa<banda#b. 


Then the following theorem gives a condition for the existence of the induced 
matrix. 


Theorem 4.1. If a semiring network does not contain any cycle 0 with w(C)<e 
(where e is the multiplicative identity), then the induced matrix A* exists. 


Uses of the induced matrix 


Each (i, j) entry of the induced matrix A* gives the value of an ‘optimal’ path 
from vertex i to vertex j. The reason why we call it the ‘optimal’ path may not be 
very obvious until we look at the applications of the induced matrix. For example, 
in the shortest path problem in which the edge-lengths are non-negative real 
numbers, the induced matrix exists and is actually the distance matrix (matrix 
whose entries show the distance from one vertex to another) [1, 4, 5]. Similarly, 
for the reliability problem described in [13], the induced matrix is the reliability 
matrix. 

From the induced matrix, many other properties of the networks can be 
obtained mainly because many properties of the networks can be defined in terms 
of the entries of the induced matrix. Let us illustrate this by the transportation 
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problem with the semiring structure (RU {»},L,+) for its edges, where : 
denotes the minimum operation. 

The radius of a vertex, centres, centroids, total distance of a network are 
defined in terms of the distance of the shortest path between two vertices. the 
radius of a vertex k is the maximum of aj, + a;;, where i, j runs from 1 to n and it 
can be found as follows: 

(1) from the kth row of A*, aj,, @j2,...,@z, find the maximum, aj, of all 

elements in the kth row. 

(2) find the maximum, ai, of all the elements af,, a3,,...,a%*,, in the kth 

column of A*. 

(3) then the radius is a}, + a;,. 

The centre of a network is the vertex with the smallest radius and it can be found 
from the induced matrix A* as follows: 

(1) Let R=(n,...,7,) be the vector such that 


r= minimum{aj4, ..., a7} 
and let C = (c,,..., ¢,) be the vector such that 
c; = maximum {aft,, ... , ax}: 
(2) let S=R+C; then vertex i is the centre if 
$;=minimum{s,, ...,5,}. 


The total distance at a vertex i is defined as 


n 
da; + Day, 
j=l j=l 
and it can be obtained by adding the sum of all entries in the ith row and the sum 
of all entries in the ith column. The total distance of a network defined as 


a 
GjpevxVv 

and it is equal to twice the sum of all the entries of A*. To compare two different 

network designs, the total distance may be used as a form of measurement. 

Another measure that may be considered is the cost-effective ratio for two 

networks, which is defined as 


decrease in total distance 
increase in cost 


and this can be obtained by computing the total distances of the networks and 
their respective costs. 
Another application is to find a shortest path from a vertex i to another vertex 
j. Such a path can be obtained by the following algorithm: 
(1) If a =A, then there is no path from vertex i to vertex j; stop. Otherwise, 
go to Step (2). 
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(2) Take the ith row, R; and jth column, C; of the induced matrix A”*. 
(3) Let S=R,+C,. If there exists k#i,j such that s,=aj, then k is an 
internal vertex of a path from i to j; go to Step (4). Otherwise, (i, /) is such 
a path; stop. 
(4) Use this algorithm to find a shortest path from i to k and a shortest path 
from k to j. Then these two paths constitute a shortest path from i to j. 
An example in which the weight of vertices is defined is the inference network. 
Consider the network with vertex weight and edge weight both defined on 
{{0, 1], °,x), where I’ denotes the maximum operation. The inferenced value at 
a vertex v is 


max{w(u) X w(P)}, 


where u runs through every vertex in V(92), and p runs through every path from 
u to v. the inferenced value of all the vertices can be found from the induced 
matrix A* as follows: 

(1) Let W=(w,,...,w,), where w; = w(i). 

(2) Let W'= W®A’*, where ® is the matrix multiplication for matrices over 

the semiring ([0, 1], I, x). 

Then the entries of the vector W’ gives the inferenced value of each vertex in 
(Q, w). 

The above examples do not exhaust all the applications of the induced matrix. 
Other types of applications occur in network flow problems, assignment prob- 
lems, critical path analysis, reliability problems [2, 3, 13]. 


Various algorithms computing A* 


Many algorithms have evolved for computing the induced matrix. Two such 
algorithms are the cascade algorithm and Floyd’s algorithm [S—9]. These two 
algorithms operate on whole matrices. Variations of these two algorithms that 
decompose large matrices have also been developed [10, 11]. We shall briefly 
describe the cascade algorithm and Floyd's algorithm below. 


The Cascade Algorithm 


Fori=1ton 
forj=l1ton 


ay = @ (ax ® ay;) 


k=1,..., n 


Floyd’s Algorithm 


Fork=1ton 
for each pair i,je€ {1,...,n}, let cj;=ay, @a,; let A=C@A (where C 
denotes the matrix [c;;]) 
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An algorithm using decomposition of matrices is given blow. This makes use of 
Theorem 2.2. 


Algorithm using decomposition to compute A* 


Let A! = A and for each non-negative integer 1, 


A= @® (AU @A‘), Efe t,...,m}, 
k=1,...,m 


where the matrices Al), i, j € {1,...,m}, are submatrices of A[t] as defined in 
Section 2. 


5. Conclusion 


In this paper, we see that the concept of semiring networks can be used to unify 
various network problems. The idea of imposing algebraic structure is by no 
means new. Other types of structures have also been discussed [13]. These trends 
may lead to the study of general algebraic networks. 

For large networks, various methods have been devised to simplify and 
improve efficiency of algorithms for finding properties of networks using matrices. 
The availability of software that provides whole matrix operations allows us to 
consider algorithms which have been formerly deemed as inefficient. 
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This paper shows that both the nonseparating independent set problem and feedback set 
problem can be solved in polynomial time for graphs with no vertex degree exceeding 3 by 
reducing the problems to the matroid parity problem. 


1. Introduction 


Let G =(V, E) be a graph where V and E are the sets of vertices and edges of 
G, respectively. For Sc V, G—S is the graph obtained from G by deleting the 
vertices in S. 

A set of vertices / is said to be independent if no two vertices in J are joined by 
an edge. The independent set problem is to find a maximum independent set in 
G. It is well-known that this problem is NP-hard even for planar graphs with no 
vertex degree exceeding 3 [1]. 

A set of vertices X is called a separating set if the number of connected 
components of G — X is more than that of G. The nonseparating independent set 
problem is to find a maximum independent set which contains no separating set. 
Note that this is equivalent to the connected dominating set problem. This 
problem is also NP-hard even for planar graphs with no vertex degree exceeding 4 
[2]. 

A set of vertices F is called a feedback set of G if G — F contains no circuit. 
The feedback set problem which is to find a minimum feedback set is known to be 
NP-hard [3]. 

This paper shows that both of the nonseparating independent set problem and 
feedback set problem can be solved in polynomial time for graphs with no vertex 
degree exceeding 3. We will indicate that the nonseparating independent set 
problem and the feedback set problem of a 3-regular graph are closely related in 
the sense that they correspond to the matching problem and the spanning set 
problem of some linearly represented 2-polymatoid, respectively. 
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2. Nonseparating independent set problem 


2.1. Reduction to the problem for 3-regular graphs 


A set 1 c V is called a nonseparating independent set if / satisfies the following 
two conditions: 

(N1) J is an independent set 

(N2) / contains no separating set. 

We call / a maximum nonseparating independent set if / contains maximum 
number of vertices. v(G) is the number of vertices in a maximum nonseparating 
independent set of G. dg(v) is the degree of vertex v in G, that is, the number of 
edges incident to v in G. G(A) is the induced subgtaph on AcYV, that is, 
G(A)=G—(V —A). 

We assume that G is a multigraph with no vertex degree exceeding 3. We allow 
parallel edges and loops. Note that an independent set contains no endvertex of a 
loop by definition. 

Let G(X) be a connected component of G such that |X| <2. It is easy to see 
that v(G)=v(G-—X)+1 if at least one vertex of X has no loop, and 
v(G) = v(G — X) otherwise. Hence we may assume that each connected com- 
ponent of G has at least three vertices. 

Let v be a vertex of degree one, and H be the graph obtained from G by 
adding two vertices x and y, connecting them by two parallel edges, and adding 
edges (uv, x) and (uv, y). Note that d,,(v) = d,,(x) = d,,(y) = 3. By the assumption 
that each connected component of G has at least three vertices, the vertex 
adjacent to v is a cut-vertex of G and H, and hence we have v(G) = v(H). 

Let vu be a vertex of degree two, and u and w be the vertices adjacent to v. u 
and w may be identical. Let H be the graph obtained from G by deleting v, 
adding two vertices x and y, connecting them by two parallel edges, and adding 
edges (x,u) and (y,w). Note that d,,(x)=d,,(y)=3. It is easy to see that 
v(G) = v(H). 

In either case, we can easily construct a maximum nonseparating independent 
set of G from that of H. Since we can apply the above operations until there 
exists no vertex of degree less than 3, the nonseparating independent set problem 
for graphs with no vertex degree exceeding 3 is reduced to the problem for 
3-regular graphs. 


2.2. Reduction to the matching problem of 2-polymatroids 


We assume that G =(V, E£) is a 3-regular graph. u(G) is the nullity (cyclomatic 
number) of G. Define for each X c V 


F(X) = u(G) — u(G — X). 


It is merely a routine work to verify that function f satisfies the following 
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conditions: 

(P1) f@) =0 

(P2) f(X) <f(¥) if XcY 

(P3) f(XUY)+f(XN Y)<f(X) +f(Y) for any X, YoV 

(P4) f({x}) <2 for each x € V. 
Thus P(G)=(V,f) is a 2-polymatroid (this is called a (<2)-polymatroid by 
Lovasz [4]). Note that (P1)-(P3) hold for any graph while (P4) comes from the 
assumption that G is 3-regular (f({x})<d (x) —1 in general). Note also that 
f({x}) <2 if and only if x is a cut-vertex or the endvertex of a loop. Xc V is 
called a matching of P(G) if f(X) =2 |X]. 


Theorem 1. X cV is a matching of P(G) if and only if X is a nonseparating 
independent set of G. 


Proof. We first show that if X is a nonseparating independent set of G, X is a 
matching of P(G). We shall apply induction on |X|. The theorem is trivially true 
if |X|=1. For any v eX, X — {uv} is a nonseparating independent set of G. Since 
IX —{v}|<|X|, f(X—{v})=2|X—{v}| by the induction hypothesis. 
dg—(x-v)(v) = 3 and v is not a cut-vertex of G — (X — {v}) or the endvertex of a 
loop, for X is a nonseparating independent set of G. Hence f(X)=f(X- 
{v})+2=2|X|. 

Now, we shall show that if X is not a nonseparating independent set of G, 
f(X)<2 |X|. We may assume that X is a minimal one by (P3). If X consists of 
One vertex v, uv is a cut-vertex of G or the endvertex of a loop, and hence 
f(X) <2 |X| =2. If X consists of more than one vertex, X — {v} is a nonseparat- 
ing independent set of G for any veX. Since X is not a noOnseparating 
independent set of G, v is adjacent to some vertex in X — {v} or there exists a 
separating set A c X containing v. Hence, dg_(y-,)(v) <2 or v is a cut-vertex of 
G —(X — {v}), and so f(X) <f(X — {v}) #1<2|X|. O 


Lovasz presented a polynomial-time algorithm to obtain a maximum matching 
of a linearly represented 2-polymatroid (which will be defined later) [4]. Hence, 
by Theorem 1, we can solve the nonseparating independent set problem for G in 
polynomial time provided that we can obtain a linear representation of P(G) in 
polynomial time. 

We shall show that there exists a linear representation of P(G) for any G. 


2.3. Linear representation of P(G) 


Let M“(G) = (E, p*) be the dual matroid of the circuit matroid of G =(V, E), 
where 4 is the rank function of M4(G). 

Construct M*(G) = (E*, p*) from M“(G) and G as follows: 

(1) For each v € V, add two elements ef(v) and e3(v) to M4(G) so that ef and 
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e3(v) are parallel to arbitrary distinct elements of A,;(v), where A¢(v) is 
the set of edges incident to v in G. 

(2) Delete E from the resulting matroid. 

Since M4(G) is a linearly representable matroid, so is M*(G). We can obtain 
in polynomial time a linear representation of M*(G) from graph G represented 
by, for example, the incidence matrix. 

Given a matroid (E,r) and subsets A,, A>,...,A, CE, we can define a 
polymatroid (S, r’) on the set S = {A,, A2,..., A,} by 

r'(X)=r(U {A |A€X}) (X cS). 
If (E, r) is linearly represented, (S, r') is said to be linearly represented and we 
call (E, r) a linear representation of (S, r’). 


Theorem 2. M*(G) is a linear representation of P(G)=(V, f). 


Proof. Let p be the rank function of G. Define for any X CV, A(X)c E and 
X* c E* as follows: 


A(X) = {e |e € E, e is incident to some ve X c V} 
X* = {e*(v) |uEeX CV, i=1, 2}. 
By the construction of M*(G), 
p*(X*) = p4(A(X)) = |A(X)| + p(E — A(X)) — p(E) 
= |A(X)| + p(E — A(X)) + u(G) — |E| 
= u(G) — (JE| — |A(X)| — p(E — A(X))) 
= u(G)— (IE — A(X)| — p(E — A(X))) 
=u(G)— u(G— X) 
=f(X). 
Hence / (G) is a 2-polymatroid on the set {{e?(v), e3(v)}|veV}. O 


Thus, the nonseparating independent set problem can be solved in polynomial 
time for graphs with no vertex degree exceeding 3. 


3. Feedback set problem 


3.1. Reduction to the problem for 3-regular graphs 


A set Fc V is called a feedback set if G — F contains no circuit. We call Fa 
minimum feedback set if F contains a minimum number of vertices. 7(G) is the 
number of vertices in a minimum feedback set of G. 
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We assume as in Section 2 that G is a multigraph with no vertex degree 
exceeding 3. 

Let v be a vertex of degree one. It is trivial that n(G) = n(G — {v}). 

Let v be a vertex of degree two which is the endvertex of a loop. Then 
n(G) = n(G — {v}) +1. 

Hence we may assume that G has neither a vertex of degree one nor a vertex of 
degree two which is the endvertex of a loop. 

Let uv be a vertex of degree two, and u and w be the vertices adjacent to v. u 
and w may be identical. Let H be the graph obtained from G by deleting v and 
adding an edge (u, w). It is easy to see that n(G)=n(H), and we can easily 
construct a minimum feedback set of G from that of H. 

Since we can apply the above operations until there exists no vertex of degree 
less than 3, the feedback set problem for graphs with no vertex degree exceeding 
3 is reduced to the problem for 3-regular graphs. 


3.2. Relation to the nonseparating independent set problem 


A set TcS of a 2-polymatroid P=(S,r) is called a spanning set of P if 
r(T)=r(S). The following equality which is a generalization of Gallai’s identity 
for graphs is proved in [4]. 


|T| + |D| =r(S), 


where T and D are a minimum spanning set and a maximum matching of P, 
respectively. 

We assume that G is a 3-regular graph. From the definitions of the spanning 
set, the 2-polymatroid P(G), and the feedback set, we have the following 
theorem. 


Theorem 3. X < V is a spanning set of P(G) if and only if X is a feedback set 
of G. 


By Theorems 1 and 3, and the generalized Gallai’s identity, we have the 
following theorem, which was proved for connected cubic graphs by Specken- 
meyer [5]. 


Theorem 4. For a 3-regular graph G =(V, E), 
n(G) + v(G) = w(G). 


By this theorem and the fact that for any maximum nonseparating independent 
set / of G, each 2-connected component of G — / has at most One circuit, we can 
easily construct a minimum feedback set from /. Since we can obtain a maximum 
nonseparating independent set in polynomial time as shown in the previous 
section, we can solve the feedback set problem in polynomial time for graphs with 
no vertex degree exceeding 3. 
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In this paper, it is shown that a necessary and sufficient condition for the existence of a 
P,-factorization of K,,,, is (i) m+n =0 (mod 3), (ii) m <2n, (iii) n < 2m and (iv) 3mn/2(m + 
n) is an integer. 


1. Introduction 


Let P; be a path on 3 points and K,,,,, be a complete bipartite graph with partite 
sets V, and V2, where |V,|=m and |V,|=7. A spanning subgraph F of K,,,,, is 
called a P;-factor if each component of F is isomorphic to P. If K,,,,, is expressed 
as a line-disjoint sum of P;-factors, then this sum is called a P;-factorization of 
Kin a 

In this paper, a necessary and sufficient condition for the existence of a 
P,-factorization of K,,,,, will be given. 


2. P,-factor of Kin 
The following theorem is on the existence of P,-factors of K,,,,. 


Theorem 1. K,,,,, has a P3-factor if and only if (i) m +n =0 (mod 3), (ii) m<2n 
and (iii) n <2m. 


Proof. Suppose that K,,,,, has a P3-factor F. Let t be the number of components 
of F. Then r=4(m-—n). Hence, Condition (i) is necessary. Among these ¢ 
components, let x and y be the number of components whose endpoints are in V2 
and V,, respectively. Then, since F is a spanning subgraph of K,,,,, we have 
x+2y=m and 2x+y=n. Hence x=4(2n—m) and y=}(2m—n). From 
O0<x<m and 0<y <n, we must have m <2n and n<2m. Conditions (ii) and 
(iii) are, therefore, necessary. 

For those parameters m and n satisfying (i)-(iii), let x =4(2n —m) and 
y =4(2m — n). Then x and y are integers such that 0 <x <m and0<y <n. Hence 
x +2y=m and 2x +y=n. Using x points in V, and 2x points in V2, consider 
x P;’s whose endpoints are in V>. Using the remaining 2y points in V, and the 
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remaining y points in V,, consider y P;’s whose endpoints are in V,. Then these 
x+y P,’s are line-disjoint and they form a P-factor of K,,,.. O 


Corollary 1. K,,,, has a P,-factor if and only if n =0 (mod 3). 


3. P,-factorization of K,,, 
Our main theorem is On the existence of P;-factorizations of K,, py. 


Theorem 2. K,,,,, has a P3-factorization if and only if (i) m+n =0 (mod 3), (ii) 
mS 2n, (iii) n <2m and (iv) 3mn/2(m + n) is an integer. 


Proof. Suppose that K,,,, is factorized into r P,-factors. By Theorem 1, 
Conditions (i)—-(iii) are obviously necessary. Let ¢ be the number of components 
of each P,-factors. Then t=}(m +n) and r=3mn/2(m+n). Hence, condition 
(iv) is necessary. The proof of sufficiency will be given in Subsection 3.2. 


3.1. Extension theorem of P3-factorization of Kin.n 


We prove the following extension theorem, which we use later in the paper. 


Theorem 3. /f K,,,,, has a P3-factorization, then Ksn.5, has a P3-factorization for 
every positive integer s. 


Proof. Let V,, V2 be the independent sets of K,,,,,, where |V,| = sm and |V| = sn. 
Divide V, and V, into s subsets of m and n points each, respectively. Construct a 
new graph G with a point set consisting of the subsets which were just 
constructed. In this graph, two points are adjacent if and only if the subsets come 
from disjoint independent sets of K,,, 5, G is a complete bipartite graph K,.,. 
Noting that the cardinality of each subset identified with a point set of G is m orn 
and that K,, has a 1-factorization, we see that the desired result is obtained. 
1-factorizations of K,,, are discussed in [1,2]. O 


3.2. The proof of the sufficiency of Theorem 2 


There are three cases to consider. 


Case (1) m=2n: In this case, from Theorem 3, K2,,, has a P,-factorization since 
K, , is just P,. 


Case (2) n=2m: Obviously, K,,.2, has a P3-factorization. 


Case (3) m<2n and n<2m: In this case, let x =4(2n-—m), y= }(2m—n), 
t=4(m+n) and r=3mn/2(m+n). Then from Conditions (i)-(iv), x, y, t, r 
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are integers and O0<x<m and O0<y<n. We have x+2y=m and 2x+y=n. 
Hence r= (x + y) +xy/2(x +y). Let z =xy/2(x + y), which is a positive integer. 
And let (x, 2y)=d, x =dp, 2y=dq, where (p,q)=1. Then dp is even and 
z = dpq/2(2p + q). The following lemmas can be verified. 


Lemma 1. (p, q)=1> (pq, p+q)=1. 
Lemma 2. (p, g)=1> (pq, 2p + q) =1 (q: odd) or 2 (q: even). 


Using these p, q, d the parameters m and n satisfying Conditions (i)—(iv) are 
expressed as follows: 


Lemma 3. (p, q) = 1 and dpq/2(2p + q) is an integer 
(1) m=2(p+q)(2p + q)s, n= (4p + q)(2p + q)s when gq is odd, 
(II) m=2(p +2q')(p+q')s, n=2(2p+q')\(p+q')s when q=2q' and 
q' is odd, 
(III) m=(p+4q")(p +2q")s, n=2(p+q")(p+2q")s when q=4q", 
where s is a positive integer. 


> 


We use the following notations for sequences. 


Notation. Let A and B be two sequences of the same size such as 
A: @,,@2,...,@, 
B: b,, b2,..., by. 


If b;=a,+c (@=1,2,...,u), then we write B=A+c. If b; =((a;+c) modw) 
(i=1,2,...,u), then we write B=A+c modw, where the residuals a; +c 
mod w are integers in the set {1,2,..., w}. 


For the parameters m and n in (I)-(III) when s=1, we can construct a 
P;-factorization of K,, n. 


Lemma 4. (p, q)=1 and q is odd 
m=2(p + q)(2p + q), n= (4p + q)(2p +4) 
> Kan has a P3-factorization. 


Proof. The proof is by construction (Algorithm I). Let x=4(2n-—m), y= 
4(2m —n), t=43(m +n), r=3mn/2(m +n). Then we have x =2p(2p +q), y= 
q(2p+q), t=(2p+q), r=(p+q)(4ptq). Let n=ptq, n=4p+q, m= 
m/r,=2(2p + q), No=n/r,=2p+q. Consider two sequences R and C of the 
same size 4(2p + q). 


R: 1,1, 2,2,..., 2p +q), 22p +4) 
C: 1,2,...,4(2p+q)—1, 4(2p +4q). 
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Construct p sequences R; such that R;=R+2(i-1)2p+q) (@=1,2,...,p). 
Construct p sequences C; such that C,=(C + 2(i— 1) mod 4(2p + q)) +40 - 


1)(2p +q) ((=1,2,..., p). Consider two sequences R’ and C’ of the same size 
2(2p + q). 

R': 1,2,...,2(2p +q)—1, 2(2p +q) 

CYL 3.0, 2p +q, 2) 4,25 2p +q— J, 1s 3)0.<, 2p g, 2,4,.25, 2p + 
q-1. 


Construct q sequences R; such that R;=R'+2(i-1)2p+q)+2p(2p+q) 
(i=1,2,...,q). Construct q sequences C; such that C;=(C’+(i-1)+ 
2p mod 2p + q)+(i-1)2Qp+q)+4p(2p+q) (i=1,2,...,q). Consider two 
sequences / and J of the same size. 


I: Ry, Ro, ..., Ry, Ri, Ri, -.., Ri 


Te Cir siege CWE in ah, 


Then the size of J or J is 2t. Let i, and j, be the kth element of J and J, 
respectively (k =1, 2,..., 2f). Join two points i, in V, and j, in V, with a line 
(ix, Je) (K = 1, 2,..., 2t). Construct a graph F with two point sets {i,} and {j,} 
and a line set {(i,, j,)}. Then F is a P3-factor of K,,,,. This graph is called a 
P;-factor constructed with two sequences | and J. 


Construct r, sequences J; such that J;,=/+(i-—1)m)modm (i=1,2,...,7). 
Construct 7, sequences J, such that J;=J+(j—1)nomodn (j=1,2,..., 1). 
Construct 7,7 P,-factors F, with J, and J, (@=1, 2,...,m;7=1,2,...,7). Then 


it is easy to show that F; are line-disjoint and that their sum is a P;-factorization 
of K,,.,, 0 


Lemma 5. (p, q) = 1 and q = 2q' (q': odd) 
m=2(p +2q’)(p+q'),n=2(2p +q'\(p+q') 
> K,,., has a P3-factorization. 


Proof. The proof is by construction (Algorithm II). Let x =4(2n-—m), y= 
4(2m —n), t=4(m+n), r=3mn/2(m +n). Then we have x =2p(p+q'), y= 
2q'(p+q'), t=2%(p+q'y, r=(p+2q')(2p+q'). Let »=p+2q', n=2p tq’, 
my= m/r, = 2(p +q'), no = n/n =2(p +q'). Consider two sequences R and C of 
the same size 4(p + q’). 


R: 1,1,2,2,...,2(p+q'),2(p+q’) 
C: 1,2,...,4p+q')-1,4(p+q’). 


Construct p sequences R, such that R,=R+2(i-1)(p+q’) @ =1,2,..., p). 
Construct p sequences C, such that C,;=(C +2(i— 1) mod 4(p +q’)) +4 - 
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1)(p +q') @=1, 2,..., p). Consider two sequences R’ and C’ of the same size 
4(p +q’'). 
R': 1,2,...,4(p+q')—-1,4(p+q') 
C': 1,3,...,2(p+q')—1,1,3,...,2(p+q')—1,2,4,...,2(p+q’), 
2,4,...,2(p+q’). 
Construct q' sequences R; such that R;=R'+4(¢-—1)(p+q')+2p(p+q’') 
(i=1,2,...,q'). Construct qg' sequences C; such that C;=(C'+2(i-1)+ 


2p mod 2(p + q’)) +20 —1)(p + 4’) + 4p(p +4’) @=1,2,...,4'). 
Consider two sequences J and J of the same size 21. 


PRR ORG Re, Ree RO 
FOC ce i CONC OC. 


Construct r; sequences J; such that ,=/+(i-—1)mypmodm (i =1,2,...,7). 
Construct r, sequences J; such that J;.=J+(j—1)nomodn (j=1,2,...,n). 
Construct rr, P;-factors F; with J; and J; (i=1,2,...,74;7=1,2,...,7). Then 


it is easy to show that F; are line-disjoint and that their sum is a P;-factorization 
of K,,,. O 


Lemma 6. (p, q) = 1 and q = 4q" 


m = (p + 4q")(p +24"), n =2(p + q")(p +2q") 
> K,,,, has a P3-factorization. 


Proof. The proof is by construction (Algorithm III). Let x= 4(2n-—m), y= 
1(2m—n), t=\(m+n), r=3mn/2(m+n). Then we have x=p(p+2q’), 
y =2q"(p +29"), t= (pt+2q"), r=(p+-+q")\(p+q"). Let =p +4q", n=p+ 
q", my =m/r, = p + 2q", ny = n/n, = 2(p + 2q"). Consider two sequences R and C 
of the same size 2(p + 2q"). 


R: 1,1,2,2,...,p+2q",p+2q" 
C: 1,2,...,2(p +2q")— 1, 2(p + 2q”). 
Construct p sequences R; such that R;=R+(i-1)(p+2q") ((=1,2,..., p). 


Construct p sequences C,; such that C;=(C + 2(i — 1) mod 2(p + 2q")) + 2G - 
1)(p + 2p”) i =1, 2,..., p). Consider two sequences R’ and C’ of the same size 


4(p +29"). 
R': 1,2,...,4(p +2q")- 1, 4(p +29") 
Ch: 1,3,..., 2p +29")— 1,2, 4,..., 2p +2q”), 3, 5,---, 2p +29") — 1, 1, 
4,6,...,2(p +2q"), 2. 


Construct q” sequences R; such that R; =R'+4(i-—1)(p + 2q") + p(p + 2q") 
(i=1,2,...,q"). Construct q” sequences C; such that C;=(C’+4(@-1)+ 
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2p mod 2(p + 2q”)) + 2(i — 1)(p + 2g") + 2p(p + 2q") (6 =1, 2, ..., qg"). Consider 
two sequences / and J of the same size 2¢. 


Ee Rig Rossi Re RU Ree 

PO Conn Cos Cie Cy ea Cae 
Construct 7, sequences /; such that J, =/+(i-—1)m)>modm (i=1,2,...,n). 
Construct r, sequences J; such that J;=J+(j—1)nomodn (j=1,2,..., 7). 
Construct r,~, P;-factors F, with J; and J; (@=1,2,...,73j=1,2,...,7™). Then 


it is easy to show that F;, are line-disjoint and that their sum is a P,-factorization 
of K,,,. O 


Applying Theorem 3 with Lemmas 4 to 6, it can be seen that for the 
parameters m and n satisfying Conditions (i)-(iv), K,,,, has a P,-factorization. 


This completes the proof of Theorem 2. 


Corollary 2. K,,,, has a P;-factorization if and only if n=0 (mod 12). 
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During the past few years papers have appeared that take a graph theoretic approach to the 
investigation of PL-manifolds. These graphs have variously been called crystallizations, gems 
(graph encoded manifolds) and n-graphs. The basic ideas and major results of this 
combinatorial method are surveyed in this paper. 


1. Introduction 


The use of combinatorial methods in topology is certainly not new; simplicial 
homology is an obvious example. However the graph theoretic approach to be 
surveyed here is fairly recent, due to Pezzana, Ferri, Gagliardi, Lins, Mandel, 
Vince and others. This paper is not intended as a complete review of the subject; 
for this we refer to [7] where an extended list of references can also be found. 
Instead we present only major results with motivation. Proofs are either sketched 
or omitted. A reference is given for each result but the proof sketched may not be 
the original. 

Graphs can have multiple edges and V(G) and E(G) denote the point and 
edge sets of G respectively. Let [n] denote the set {0, 1,...,2—1}. Ann-graph 
is a graph G, regular of degree n, together with an edge coloring E(G)— [n] such 
that incident edges are different colors. The motivation for this definition is that 
an (n + 1)-graph G encodes an n-dimensional simplicial complex AG as follows. 
For each point v of G let o, be an n-simplex whose set of n +1 vertices is in 
bijection with [n + 1}. Let k be the disjoint union of the o,, véV(G). For each 
ie[n+ 1] identify the (n — 1)-face of 0, colored [nm + 1] — {i} with the (n —1)- 
face of o, colored [n + 1] — {i} if and only if u and v are joined in G by an edge 
colored i. If ~ is this indentification then K/~ is denoted AG and the underlying 
topological space of AG is denoted |G|. Fig. 1 is an example of a 3-graph G and 
the corresponding 2-dimensional complex AG. The underlying space |G| is the 
2-sphere. 

Throughout this paper manifolds will be PL, compact, and without boundary. 
The simple, but basic, fact in the theory is that any PL n-manifold M is 
homeomorphic to |G| for some (n + 1)-graph G [2, 19, 21]. Such a G is obtained 
as the appropriately colored dual 1-skeleton of the barycentric subdivision of any 
triangulation of M. Recall that the dual 1-skeleton of a triangulation is the graph 
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2 


S AG 


Fig. 1. A 3-graph and the associated simplicial complex AG. 


whose points are the facets and two points are joined by an edge if and only if the 
corresponding n-simplices share a codimension 1 simplex. Hereafter > denotes 
homeomorphism. 


Theorem 1. For any PL n-manifold M there exists an (n + 1)-graph G such that 
|G|=M. 


It is known that every 1, 2, or 3-manifold can be triangulated and hence can be 
encoded as a 2, 3, or 4-graph. Therefore in these low dimensions the scheme is 
completely general. In Fig. 2, for example, are graphs representing the sphere 
product S'x S? and the non-orientable sphere bundle S' x S’, which will be 
useful later. Our point of view from here on is to regard an n-manifold as an 


s'x S$? S'xS? 


Fig. 2. Encodings of the orientable and non-orientable 2-sphere bundles over S'. 
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(n + 1)-graph. The goal is to gain topological insight into the space |G| from the 
combinatorics of the graph G. 

The basic notions about n-graphs addressed in this paper are (1) fusion, (2) 
canonical forms, (3) fundamental group and (4) regular embedding. Basic results 
connecting graphs and manifolds are contained in Section 2. In Section 3 
equivalence of n-graphs is defined. Equivalence of graphs G, and G, corresponds 
to homeomorphism between the topological spaces |G,| and |G|. An equivalence 
step is one of several types of fusion, a basic operation on n-graphs that is also 
discussed in Section 3. The ideal situation would be to have, for each PL 
manifold, a unique canonical n-graph. Unfortunately this is known only for n <3 
and is covered in Section 4. Two simple algorithms on an n-graph G are given in 
Section 5 for determining the fundamental group of |G|. Regular embedding of 
an n-graph on a closed surface provides a new topological invariant for 
manifolds—the graph theoretic genus. For 2-manifolds this invariant is the 
ordinary genus of the surface, and for 3-manifolds it is essentially the Heegaard 
genus. This and other properties of embedding are discussed in Section 6. 


2. Graph encoded manifolds 


Although proofs concerning n-graphs are often combinatorial, the next result 
provides the link between the combinatorics of the graph G and the topology of 
the complex AG. For an n-graph G and for J c[n] let G, denote the subgraph of 
G obtained by deleting all edges with colors not in J. Each connected component 
of G, is a |J|-graph and is called a residue of type J or |J|-residue. In particular, G 
itself is the only n-residue; each point of G is a O-residue; each edge is a 
1-residue; 2-colored cycles in G are 2-residues, etc. 


Theorem 2. There is a 1-1, inclusion reversing correspondence between the 
residues of an (n + 1)-graph G and the simplices of AG. For i €[n], i-residues in 
G correspond to (n —i)-simplices in AG. 


By the above theorem, facets (highest dimensional simplices) of AG 
correspond to points of G; codimension 1-faces of AG corresponds to edges of 
G;...3 vertices of AG correspond to n-residues of G. Hence links of vertices in 
AG are encoded by n-residues; in general, links of i-faces in AG are encoded by 
(n — i)-residues of G. 

A 2-graph encodes the circle $'; a 3-graph encodes a surface. For an n-graph, 
n = 3, the underlying space |G| is not necessarily a manifold because the link of a 
vertex in AG is not necessarily a sphere. As a consequence of Theorem 2 a 
4-graph encodes a 3-manifold exactly if each 3-residue encodes a 2-sphere. 
Moreover, application of the Euler characteristic to 3-residues results in the 
following necessary and sufficient condition for a 4-graph to encode a manifold 
[2, 19]. 
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Theorem 3. Let v denote the number of vertices in a 4-graph G and mr, rs the 
number of 2 and 3-residues, resp. Then v = r2 — 1, with equality if and only if |G| is 
a manifold. 


A space |G| is orientable if there is a coherent orientation of the facets in AG. 
In terms of the graph G this is expressed as follows [21]. 


Theorem 4. The topological space |G| is orientable if and only if the graph G is 
bipartite. 


3. Equivalent n-graphs 


A basic construction in the theory of n-graphs is fusion. Consider two points u 
and v in an n-graph G(or in two n-graphs G, and G,) and let G* be the n-graph 
obtained by 

(1) removing u, v and all edges connecting them; 

(2) reconnecting the ‘free’ edges (previously incident to one of u or v) of 

like color. 
Then G* is said to be obtained from G by fusion on u and v. If there are m>1 
edges connecting u and v then the graph removed in step (1) is called an m-dipole 
and the fusion is called removing a dipole. The inverse operation is called adding 
a dipole. If J denotes the set of colors of a dipole D of an n-graph G and if u and 
v lie in the same residue of type [n] —J, then D is called degenerate. Otherwise D 
is non-degenerate. 

Call two n-graphs equivalent if one can be obtained from the other by a 
sequence of adding or removing non-degenerate dipoles. Fig. 3 shows three 
equivalent 3-graphs. First dipole d, is added and then dipole d, is removed. 
Removing a non-degenerate dipole in G corresponds in |G| to removing a ball 
and identifying two hemispheres on the boundary in the natural way. This makes 
the ‘if part of the following theorem reasonable. What is surprising is that the 
converse is also true [4]. 


Theorem 5. |G,| = |G| if and only if G, and G, are equivalent. 


The graph of Fig. 3c encodes the Klein bottle (see Section 4). Hence, by 
Theorem 5, Fig. 3a also encodes the Klein bottle. 

The topological consequences of other types of fusion are summarized in the 
following theorem. If points u and v lie in distinct n-graphs G, and G, then fusion 
is denoted by G, # G,. Recall that the connected sum M, # M, of two manifolds 
is obtained by removing an open ball from each and identifying the two spherical 
boundaries via a homeomorphism. Also ® will stand for the orientable (ordinary 
product) or non-orientable bundle. Parts (a, b, c, d) of the theorem are found in 
(4, 17, 12, 8], respectively. 
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Fig. 3. Equivalent 3-graphs. 


Theorem 6. For 4-graphs that encode 3-manifolds 
(a) |G, # Go| ~|G\| # |G. 
(b) If u and v are not contained in the same 3-residue of G and G' is obtained 
from G by fusion on u and v, then |G'|~|G| # (S'@ S’). 
(c) If G' is obtained from G by removing a degenerate 2-dipole then 
|G|=|G'| # (S'@S?). 
(d) If G’ is obtained from G by removing a degenerate 1-dipole then one of 
following cases holds: 
(i) |G|=|G"| 
(ii) |G] ={G'| # (S' @S”). 
(iii) |G| ~|G’| # (S' ® S’) # (S' @ S?) 
(iv) |G| =|Gi| # |G3| # (S'@S’) 
depending on whether the endpoints u and v of the dipole are both contained 
on exactly (i) one 2-residue with colors other than that of edge {u, v}; (ii) 
two such 2-residues; (iii) three such 2-residues and G' is connected; or (iv) 
three such 2-residues and G' has two connected components G, and G3. 


Examples of parts (a) and (c) are shown in Figs 4 and 5 respectively. In Fig. 4, 
G, and G, both encode S' x S? and G, # G, encodes the connected sum. 

Some comments on Theorem 6 are in order. First, the graph G’ always encodes 
a 3-manifold. In part (a) |G, # G,| is independent of u and v unless both G, and 
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Fig. 4. Connected sum. 


G, are bipartite. In this case both |G,| and |G,| are orientable (by Theorem 4) and 
there are two possible topological connected sums (orientable and non- 
orientable); which one depends on the partite sets in which u and v belong. In 
part (b) @ is the orientable or non-orientable sphere bundle depending on 
whether or not there is a path from u to v with an odd number of edges. Likewise 
in (c) and (d) the graph G determines the type of bundle in a straightforward 
way, but we omit the details here and refer the reader to the original sources for 
these results. The only case of fusion not listed in Theorem 6 is that of a 


G:~S xs? iG ~ 3 


Fig. 5. Removing a handle. 
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degenerate 1-dipole where u and v are contained in no 2-residue (of type not 
containing the color of {u, v}). Unlike the cases covered in Theorem 6, there is 
no known characterization of the topological space corresponding to the 4-graph 
after the removal of such a dipole. Unfortunately, in a large class of 4-graphs, all 
dipoles are of this type [8]. 


Proof sketch of Theorem 6. In part (a) the operation # corresponds, via 
Theorem 2, to removing a facet from AG, and AG, and identifying boundaries, 
i.e. a connected sum. Likewise in part (b) the fusion corresponds to removing two 
disjoint facets from AG and identifying boundaries. This is usually referred to as 
‘adding a handle’, which is equivalent to a connected sum as in part (b). For part 
(c) a 4-graph G* exists (see [12] for the construction) such that (1) G* is 
equivalent to G' (by removing two nondegenerate dipoles of G*) and (2) G is 
obtained from G* by fusion of two points of G* not in the same 3-residue. The 
result then follows from part (b) and Theorem 5. Part (d) is proved in a similar 
fashion using part (c). O 


4. Canonical n-graphs 


One of the major goals in the theory of n-graphs is to obtain canonical forms 
from PL-manifolds. Various notions of ‘canonical’ appear in the literature. If M is 
a PL-manifold, then G is called minimum for M if G is an n-graph with minimum 
number of points that encodes M [19]. For example, the minimum 4-graph for the 
3-sphere is given in Fig. 6a (two 3-balls with their boundaries identified). The 
minimum number of (m — 1)-residues in an n-graph is n. An n-graph that achieves 
this minimum is called simple [19]. A simple n-graph that encodes a manifold is 
also referred to in the literature as a crystallization [2]. We call an n-graph with 
no non-degenerate dipoles reduced. It is easy to check that if G is reduced, then 
G is simple. The converse is not true, as in Fig. 6b. Also it is obvious that if G is 
minimum, then G is reduced. Again the converse is not true, as in Fig. 6c [15]. 
By removing nondegenerate dipoles until no longer possible, Theorems 1 and 5 


imply: 
Corollary 7. For any manifold M there is a reduced n-graph G such that |G|= M. 
A strong form of Theorem 5 for simple n-graphs is proved in [4]. 


Theorem 8. Let G, and G, be n-crystallizations. Then |G,| = |G,| if and only if G, 
can be obtained from G, by a finite sequence of the following moves: 
(a) adding or removing a non-degenerate m-dipole, n-1>m>1. 
(b) addition of a non-degenerate 1-dipole followed by the removal of another 
non-degenerate 1-dipole. 
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Fig. 6. Some 4-graphs encoding S*. 


Note that in each step in the process of Theorem 8, the intermediate n-graph 
is also a crystallization. Also for any i € [n], the moves (a) and (b) can be taken 
so that either all the dipoles contain color i or none contain color : [3]. 

The ideal situation would be to have a unique reduced graph to represent each 
equivalence class of n-graph. Unfortunately the only non-trivial case for which 
this is known is n = 3. To classify the 3-graphs up to equivalence let C,,, be the 
cycle with 2m points {0,1,..., 2% —1} where edge {2i — 1, 2i} is colored 0 and 
{2i, 2i +1} is colored 1 for all i (mod 2m). Let H,, be the 3-graph obtained from 
Can, m odd, m21, by adding edges {i,i +m} colored 2 for all i (mod 2m). 
Likewise let L,, be obtained from C,,,, m 22, by adding edges {i, 2m —i} and 
{0, m} colored 2 for all i (mod 2m). The graphs H, and L, are shown in Fig. 7. 


Fig. 7. Graphs encoding surfaces. 
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Theorem 9. Each 3-graph is equivalent to a unique graph H,,, m2=1, m odd or 
Lin) M22. 


Corollary 10. Two reduced 3-graphs are equivalent if and only if they have the 
same number of points and are both bipartite (or not bipartite). 


Theorem 9 is just a graph theoretic version of the well known classification of 
closed surfaces. Using Theorem 2 (to compute the Euler characteristic) and 
Theorem 4, it is easy to check that |H3_,| and |L3_,| are the orientable and 
nonorientable surfaces, resp., with Euler characteristic v. So |H,| is homeomor- 
phic to the 2-sphere, |H3| the torus, |L,| the projective plane, |L3| the Klein 
bottle, etc. Concerning Corollary 10, the number of points and the property of 
being bipartite are invariants of reduced 3-graphs because all steps in Theorem 8 
must be of type (a). Each step of type (a) leaves the number of points of G and 
the properties of being reduced and bipartite invariant. In the other direction the 
corollary follows from Theorem 9. If G and G' are reduced, have an equal 
number of points and are both bipartite (or not bipartite), then this is also true of 
B and B’, the graphs to which, by Theorem 9, they are equivalent. Then 
necessarily B = B’. 

Any 3-manifold can be encoded by a 4-graph, so naturally the classification 
problem for 4-graphs becomes drastically more complex than for 3-graphs. For 
example, by Theorem 5 the 4-graphs in Fig. 6c and 6a must be equivalent, but 
there is no known algorithm, in general, for obtaining the correct sequence of 
dipole additions and removals. Certain n-graphs with additional structure, called 
normal crystallizations, have been shown to encode any 3-manifold [1, 14], but 
these also have not led to a classification. (So the somewhat technical definition is 
omitted here.) The determination of a set of moves to get, algorithmically, from 
any n-graph to any equivalent one would, of course, be a remarkable discovery. 
For more on this problem see [24]. 


5. The fundamental group 


The encoding of a manifold M by an n-graph provides easy algorithms for 
finding a presentation of the fundamental group of M. We give two such 
algorithms. Here (X | R) denotes a presentation of a group with generators X 
and relations R. Let H be a subgraph of an n-graph G and e an edge in 
E(G)—E(H). Call e dependent on H if there is a k-colored cycle, k<n, 
containing e, all of whose other edges lie in H. Let H = Ho, M,,..., H, = H* be 
a sequence of subgraphs of G such that 

(1) H,,,= H, +e where e is dependent on H; and 

(2) there is no edge in G dependent on H*. 

Then H® is called the closure of H in G. 
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In both algorithms the input is a manifold encoded by an n-graph and the 
output is a presentation for 2,(|G|). 


Algorithm 1 [24]. 

(0) Remove non-degenerate dipoles until G is reduced. 

(1) Construct a spanning tree T in G. 

(2) Determine the closure T* of T in G. 

(3) Arbitrarily assign an orientation to each edge in X = E(G) — E(T*). 

(4) For each 2-residue t let 7, be the sequence of edges in X around the cycle 
t, each with exponent +1 or ~—1 depending on the orientation. Let 
R =({r, | tis a 2-residue}. 

(5) Then (X | R) is a presentation for z,(|Gl). 


Algorithm 2 [18]. 

(0) Remove non-degenerate dipoles until G is reduced. 

(1) Let X= V(G), the point set of G. 

(2) Choose two colors j,k €[n] and let C denote the set of all 2-residues 
(cycles) of type {j, k}. 

(3) For any cycle o € C of length 2m let r, denote the word x,x7! + ++ X2n—i1X3h 
where x;€X is the i-point of o. The initial point x, and the direction 
around o is arbitrary, except that the edge connecting x, and x, should be 
colored j. Let R, = {r, | 0 € C}. 

(4) Let R.={xy~'|x,yeX and x, y belong to the same residue of type 
[n] — (i, k}}. 

(5) If xo is an arbitrary point of G then (X | R, UR2U {xo}) is a presentation 
of ,(|G)). 


As an example, consider the 3-graph in Fig. 8, which encodes the Klein bottle 
K’. (Those edges not colored 0 or 1 are understood to be colored 2.) Using 
Algorithm 1 (Fig. 8a), the spanning tree T consists of all edges except one in the 
unique residue of type {0,1} and T* is the entire (darkened) cycle. Then the 


Fig. 8. Computing the fundamental group of |G]. 
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presentation of ,(K”) is (x, y, z|xyz~'=xzy =1) which, by removing gener- 
ator z, is equivalent to (x, y | x’y*?=1). 
Choosing the pair of colors {0, 1} for Algorithm 2 (Fig. 8b) the presentation is 


(x, y,z,x',y',2'|x=x',y=y',z=z',xy ‘zy 'x’z'=1,z=1) 
which simplifies after removing generators x’, y’, z', z to 


(x, y [xy ty x =1) = (x, y |x’y?=1). 


Proof sketch of Algorithms 1 and 2. The first algorithm is based on the fact that G 
is the dual 1-skeleton of AG. Then up to homotopy, each based loop of 7;(|G|) is 
represented by a based closed edge path in G. Each edge ee E(G)-— E(T) 
represents the path in G that is the unique cycle in T+e. An edge in T* 
represents a null homotopic loop in |G|; hence X is the generating set for 2,(|G|). 
Each two colored cycle in G also corresponds to a null homotopic loop in |G|, 
hence the relations R. 

The idea of Algorithm 2 is dual to that of Algorithm 1, in that loops in |G| are 
represented by based edge paths in AG rather than in G. The fundamental group 
x,(|G|) is isomorphic to the standard edge path group E(AG) whose elements 
are closed edge paths in the 1-skeleton of AG, and edge paths around 2-simplices 
are null homotopic. It is the main theorem of [13] that a presentation (X | R) for 
E(AG) can be obtained by taking X as the set of all edges of type {j, Kk} except 
one (recall that the vertices of AG are colored), and the relations are read around 
the links of simplices of type [n]— {j,k}. Translating this presentation, via 
Theorem 2, from AG to the graph G results in Algorithm 2. O 


6. Regular embedding 


Let F be a closed surface. An embedding of an n-graph G on F is called regular 
(strongly regular in [10]) if 

(a) The components of F — G are 2-cells. 

(b) For any adjacent pair of points u and v of G, the cyclic permutation t,, of 
the color set [n] induced by the edges about u is the inverse of the cyclic 
permutation 1, induced by the edges about v. 

Up to inverse the cyclic permutation of the edge colors on F is the same at each 
point. So by counting the number of such cyclic permutations, it is easy to see 
that there is at most one regular embedding of a 3-graph and at most three 
regular embeddings of a 4-graph. Note that each 2-cell of F must be bounded by a 
2-residue. Conversely, by spanning each of these (disjoint) 2-residues by a 2-cell 
and identifying pairs of edges that are the same in G, each of the regular 
embeddings above can be constructed. In general this argument shows that there 
are exactly 4(m — 1)! regular embeddings of an n-graph. 

In this section a graph theoretic invariant of a PL manifold is defined that 
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simultaneously generalizes the ordinary genus of a surface and the Heegaard 
genus of a 3-manifold. For an n-graph G let the genus p(G) be the smallest 
integer g such that G regularly embeds in a surface of genus g. (For an orientable 
surface M the genus g(M) is the number of torus factors in a connected sum 
(handles), while for a non-orientable surface it is the number of projective plane 
factors (cross caps). Then for a PL manifold M let 


p(M) = min{p(G) | |G| = M} 


be the minimum genus of any encoding of M. Using Theorem 4 it is not difficult 
to show that the surface F on which G is embedded is orientable if and only if |G| 
is orientable. Hence p(M) is the orientable or non-orientable genus depending on 
whether or not M is orientable. 

The definition above and the results that follow appear in [11]. Recall that a 
handlebody of genus g is a 3-manifold (with boundary) obtained by identifying in 
pairs 2g disjoint 2-cells on the boundary of a 3-ball [16]. A Heegaard splitting of 
genus g of a 3-manifold M is a pair H, H’' of handlebodies such that 
HO H'=0H = oH' and HU H’'=M. The common boundary 0H = OH’ is called 
the Heegaard surface, which is a surface of genus g if orientable or 2g if not. The 
Heegaard genus h(M) is the smallest integer g such that M admits a Heegaard 
splitting of genus g. 


Theorem 11. (a) /f M is a 2-manifold then p(M)=g(M). 
(b) If M is a 3-manifold then 


(M) = ae if M is orientable 

e ~ 12h(M)_ if M is non-orientable. 

Proof sketch of Theorem 11. Part (a) of Theorem 11 is easy to prove because G, 
as the dual graph of the triangulation AG, is regularly embedded on |G|. Part (b) 
is proved in two parts. Given a 4-graph G regularly embedded on a surface F, 
assume, without loss of generality, that for each cyclic permutation 1, we have 
t2(0)=1. Then consider the surface S$ that is the union of the 4-sided cells 
embedded in the facets of AG (see Fig. 9) such that there is a vertex of a 4-sided 


f=) 


2 


aw 
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Fig. 9. Portion of the Heegaard surface. 
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cells at the midpoint of each edge of a facet except the edges 01 and 23. Now S§ 
splits {G| into two handlebodies that are regular neighborhoods of the subgraphs 
of G induced by the edges colored 01 and 23, resp. Hence S is a Heegaard 
surface. The graph G is embedded in S as the dual graph of this cell division into 
squares such that the cyclic permutations about the vertices of this embedding are 
the same as for the embedding of G in F. Therefore S is homeomorphic to F. 

Conversely the Heegaard splitting of genus g determines two sets of pairwise 
disjoint curves on the Heegaard surface F. Each such set consists of the 
boundaries of a set of pairwise disjoint 2-cells that cut the handlebody H or H’ 
into a 3-ball. These systems of curves, called a Heegaard diagram, can be altered 
slightly to obtain the desired 4-graph embedding on the surface F. O 


In [6] estimates are made on the genus of some 4-manifolds; here 7,, U, denote 
the orientable surface of genus g and the non-orientable surface of genus h, 
respectively: 

p(S' x T,) =2g +1 

p(S'x U,)=2h +2 
2(g +8')< p(T, X T,) < 2(6gg' + 3(g +8’) +2) 
2(2g +h) < p(T, x U,) < 2(6gh + 3(2g + h) + 4) 
2(h +h’) < p(U, X Un) S$ 2(3(AA' +h +h') +4) 

A corollary of Theorem 11 is that a 4-graph of genus 0 encodes a manifold |G| 
that has Heegaard genus 0, and hence is a sphere. This is generalized in [5]. 


Theorem 12. For an (n + 1)-graph G, p(G) =0 if and only if |G| = 8S”. 


The proof of Theorem 12 is mainly topological. A stronger statement than this 
theorem is the following. If true, it should have a completely combinatorial proof. 
Here G2 is the n-graph with exactly two points and n edges joining them. It is the 
minimum n-graph for S”~', (Gj is shown in Fig. 6a). 


Conjecture. If G is a reduced n-graph and p(G) =0 then G = G*.. 


7. Conclusion 


The subject of n-graphs is new. Three-manifold theory is well developed, 
whereas the theory of 4-graphs is not. The hope is that the graph theoretic 
method holds some potential for combinatorial insight into topological ques- 
tions. As the subject of n-graphs has developed the proofs have become less 
topological and more combinatorial. When it is not necessary to refer to the 
complex AG, via Theorem 2, the proofs (in the biased estimation of the author) 
take on a clear graph theoretic character. 
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The point-linear arboricity of a graph G=(V,E), written as po(G), is defined as 
Po(G) = min{k | there exists a partition of V into k subsets, V = Ut, V;, such that (V,) is a 
linear forest for 1<i<k}. In this paper, we will discuss the point-linear arboricity of planar 
graphs and obtained following results: 

p.(G) = 2 if G is a outplanar graph. 
Po(G) = 4 if G is a planar graph. 


1. Introduction 


Chartrand [1] studied the point-arboricity of planar graphs and showed that the 
point-arboricity of a planar graph does not exceed 3. 

Now we will discuss the point-linear arboricity. The point-linear arboricity of a 
graph G =(V, E), written by po(G), is defined as po(G) = min{k | there exists a 
partition of V into k subsets, V =, V,, such that (V,) is a linear forest for 
1<i<k}. 

In the context, G shows that G ~ G, i.e. G is homomorphic to G. Throughout 
this paper, all the graphs are simple, and we assume that K>,; is not isomorphic to 
K,—e for e € E(K,). 

In this paper, it will be proved that po(G) <4 if G is a planar graph. 


2. Main results 
Lemma 1. G is a planar graph with order n. 
If |{v | ve V(G) and d(v) =n — 1}|>2, then po(G) <2. 
Proof. Take v,,v2.EV(G) with d(v,)=d(v2)=n—1, then it is obvious that 
d(v) <4 for ve V\{v,, v2}, (V\{v1, v2}) does not contain cyclics for otherwise 


(V(C) U {u,, v2}) would be homomorphic to Ks. So, ({v,,v2}) and (V\ 
{v,, v2}) are linear forests. O 


Lemma 2. Let G be a maximal planar graph and |V|>3, then d(v) 23 Wu eV. 


Proof. It is obvious, and therefore is omitted here. 
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Lemma 3. Let G be a maximal planar graph with |Vj|\=n>6, then 
max,¢y d(v) =5. 


Proof. By Euler’s formula we can obtain this conclusion. 

It can be proved that if G is a planar graph with |V| =n <8 then p,(G) <2. 
However, there exists a planar graph with n=9, its point-linear arboricity is 
larger than 2. See the diagram in Fig. 1. 


Fig. 1. 
Theorem 2. If G is an outplanar graph, then po(G) <2. 


Broerer and Mynhardt [3] have given a proof of the theorem. Here we give 
another proof. 


Proof. By Theorem 11, 12 in [2], G contains neither K, nor K2,;. Without losing 
the generality, we assume G is connected and |V| 2 4. 
If d(v) <2 for Vu € V, then the conclusion of Theorem 2 obviously holds. 


We now assume that there exists a point vyéV such that d(vy) 23. Let 
Ao = {Uo}, A, =N(vo) where N(u) = {uv | (u,v) € E}, Ais, = N(A,)\{A; VA,-1} 
for i121, where N(A) = Uyeu N(v). 

As V is limited, there exists an integer ¢ such that A,#§ and A,,,=9. 
Furthermore, we obtain a partition of V: 


1=0 


We are prior to prove that Vie {1,2,...,¢}, (A;) is a linear forest. Otherwise, 
there exists an A, such that (A,) contains either cyclics or K,,3. Let us assume 
that (A,) contains a cyclic C with |V(C)| =3 and take v,, v2, v3e V(C). By the 
definition of partition we know that there exists a point a €\i=),A,, such that 
Va e {1, 2, 3} there exists a (a—v.) path P, in (jo A;) which satisfies the 
following properties: 


V(P.)OV(Ps) = {a} if a #8. 
V(P.)NA,= {va} if =1, 2,3. 
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Thus, C, P,, P, and P; form a subgraph which is homomorphic to K,. A 
contradiction is obtained. 

If (A;) contains K,, with the similar method, we can prove that G contains 
K,,;. The same contradiction is obtained. 

Let 


{0/2] 

A= U Ax 
k=0 
{¢+1/2] 

B= U Ax-1 
k=1 


By definition we know E(A;, A;)=9 if |i—j|2=2. So (A) and (B) are all 
linear forests. Hence po(G) <2. O 


In the following context, G is a simple planar graph, V =\J;-, A; denotes a 
partion of V which can be obtained, similar to the proof of Theorem 2, where ¢ is 
determined by a given graph. 


Lemma 4. Vie {1,2,...,¢}, (A;) is an outplanar. 


Proof. Otherwise, there is i such that (A,;) contains either K, or K,3. Let us 
assume that (A;) contains a subgraph H = K, and denote the four points which 
are not subdivisible v,, v2, U3, Uy. If i=1 then (V(H) U {uv 9}) homomorphic to 
K, is a contradiction. So, i>1, by the definition of partion V = j-, A;, one of 
following two cases will arise: 


Case 1. There exists a point ae >A, such that (Ui-9A,;) contains 
(a, v,)-path P, for a = 1, 2, 3, 4, which satisfies the following properties: 
V(P,)N V(A) = {V.} for =1, 2, 3,4 
V(Px)OV(Ps) = {a} if #8. 


Case 2. There exist two different points a and b €\_i,—) A; such that there exist 
two points, without losing the generality we assume v,, U2, such that (Uj=04;) 
contains (a, v,)-path P, for =1,2. Meanwhile, there exists (b, ug)-path P, for 
B =3, 4 and (a, b)-path P,,,. These paths satisfy the following properties: 

V(P,)N V(P:) = {a} 

V(P3) 0 V(P,) = {5} 

V(Px) OV (Pg) =9 a=1,2, B=3,4. 
V(Pa,6) O V(Ps) = {a} a=1,2 

V(P.6) V(Ps) = {5} B=3,4 
V(P,)OV(H)={V.}  w=1,2, 3,4 

V(P,.2)N V(A) =9. 
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For Case 1, HUP,UP,UP,UP, is homomorphic to Ks. For Case 2, 
HUP,UP,UP,UP,UP,,, contains K, . These two cases result in contradiction. 
So, (A;) does not contain K,. 

Similarly, we can prove that (A,) does not contain K,3. O 


Theorem 3. Let G be a planar graph, then p.(G) <4. 


Proof. We can obtain the conclusion immediately from Theorem 2 and Lemma 
4.0 


Corollary 1. Let G be a planar graph with |V(G)|=n. If there exists a point 
uv eV, such that d(v) =n — 3, then p,(G) <3. 


Proof. It is obtained immediately from Lemma 4. 0 
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This work presents a simple proof of the Littlewood—Richardson rule on multiplying Schur 
functions using nonintersecting paths and a characterization of Schur functions by this rule. 


1. Introduction 


Since the work [11] of MacMahon was published, many papers on com- 
binatorial structure in symmetric groups followed, among them, [1-4, 7, 9, 12, 
14] and [17-21]. The algorithmic structure of the expansion of the product of 
several Schur functions was discussed in 1934 in the paper [9] of Littlewood and 
Richardson. The algorithm that appeared in this paper is called the Littlewood- 
Richardson rule. A good deal of effort has gone into validating this algorithm. 
The first attempt was by Robinson [12], but his proof was not satisfactory as is 
shown in MacDonald [10], p. 73 and Remmel—Whitney [14], p. 485. By applying 
the Robinson-Schensted—Knuth algorithm, White [21], Hillman—Grassel [6], 
Thomas [19] and Remmel—Whitney [14] recently gave some proofs of this rule. 

The Littlewood—Richardson rule in [9] for multiplying Schur functions S,(x) 
and §, (x) states that 


S(X)S.x)= Dd ch, Sux), (1) 


JUJ=lAl +p} 
where 


ci, = number of tableaux of shape u/A such that the word 


is a lattice permutation and the weight is z. (2) 


Since the Schur function S,(x) is determined only by the shape A of the Young 
tableau, and each Young tableau uniquely corresponds to a line of stones by Sato 
[15, 16], in this paper the Schur function is treated as a line of stones. Then, the 
rules (1) and (2) of the multiplying Schur functions give a random walk of the 
stones as nonintersecting paths (as for this definition, see [5] and as for the recent 
works, see [4] and [8]). Thus, the Schur function is grasped as an operator to 
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ie) tie Jas re 
-1 


-3 


Fig. 1. 


bring about a motion of the stones. As a result, the Jacobi-Trudy identity of the 
Schur function (cf. [10] p. 25) is shown. 


2. Nonintersecting paths in the construction of Young tableaux 


In this section, we discuss the correspondence between Young tableaux and 
arrangements of stones on the integer line (cf. [15] and [16]). In addition, we give 
a graph representation of the motion of these stones as nonintersecting paths. 

We arrange all Young tableaux on the left corner in the product quarter space 
Z, xX Z_ demonstrated in Fig. 1. Then the correspondence between Young 
tableaux and the arrangements of stones is given as follows (see Fig. 2), 

(1) For the empty Young tableau, the corresponding line of stones is 

constructed by placing stones on all negative integers in the line, 

(2) For the Young tableau with shape (A,,0,...,0,...), the corresponding 

line of stones is constructed by moving up the stone on —1 of case (1) to 
the position A, — 1, 


(1) (2) (3) (4) 


Fig. 2. 
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<M fu) ]tuy] --- {u,_, lth! 


Fig. 3. 


(3) For the Young tableau with shape (A,, 42, 0,...,0,...), the correspond- 
ing line of stones is constructed by moving up the stone on —2 of case (2) 
to the position A, — 2. 

Thus, proceeding in this manner, we obtain the lines of stones in the Young 

tableau with general shape (A,, A2,...,0,...,0,...); 

(4) The stones on the line are placed on the positions denoted by integers 

{A,—1,4,.-2,...,a, —n, —n—1, -n—-2,...}. 

To represent the line of stones in the general case (4), we use the notation (A| 
for A=(A,,...,4,,0,...,0,...). 

We now define the raising operator [fo], Uy € Z, on the space of the line of 
stones {(A|;A=(A,,...,4,,0,...), A, 2A, B+: BA, > 0}. 

Move each stone in the state (A| up from the lower part of the line of stones 
such that no stone displaces any stone above it and the total sum of the 
movements is equal to #o. The movements are denoted by the raising operator 
[uo] and the generated states are denoted by (A| [Mo]. The application of the 


raising operators [,],...,[4n], in order, leaves tracks which give a set of 
nonintersecting paths with m steps in the direction of the right (See Fig. 3). We 
denote the set of nonintersecting paths by (A|[,],..., [Hnl- 


3. Nonintersecting path as words 


In this section, we discuss the connection between words and nonintersecting 
paths, and we give a transformation rule for paths that preserves the property of 
nonintersection. 

Let L be a nonintersecting path in the set (A|[u,]---[#,]. We denote the 
steps in L generated by the operator [,] by the integer k. Let us arrange these 
integers in order from the right hand side of the topmost path in L. The sequence 
of these integers is called the word w(L) of L. An example is given in Fig. 4. 
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| w(L) = 2211332. 


Fig. 4. 


We assume the positive integers ,,...,/,, associated with the raising 
operators [,], ..., [Hm] are in the decreasing order, pf, =u, >-+-* > Um. 

Let us define the index /,(L) of the word w(L) with respect to i and j. Let 
w(L,) denote the subword in w(L) generated by the first k steps. Then the index 
[,(L,) is defined as follows: 


[,(L,) = {the number of occurrences of the letter i in the word w(L,)} 


— {the number of occurrences of the letter j in the word w(L,)}. 


Let the index /,(L) with respect to i and j be the minimum of [,(L,) with respect 
to k: 


[,(L)= min -1,(Lx)- 


For i<j, the word w(L) is called an (i, j)-lattice permutation if and only if the 
index [,(L) is a non-negative integer. If the word w(L) is an (i, j)-lattice 
permutation for any i<j, the path L is called a lattice permutation. 

Let us state a rule given by Robinson [13], p. 46 for transforming non-(i, i + 1)- 
lattice permutation paths in (A|[H,]...[H,] onto (Al [mi]... [wi + Ulmer - 
1]... [u,,]. We call it Robinson’s recomposition rule with respect to (i, i +1). It 
is composed of two parts as follows: 

(1) If the index /,,,,(L) for L in (A| [ui]... [u,,] is a negative integer, the 

letter i+ 1 in the first position k for which J, ;,,(L) = [,;.,(L,) is changed 
into 1. 

(2) Let the path L be in (A\[u,]... [ui + I[uir.—1)--- [Hn]. If the index 
Lai(L) is a nonpositive integer, the letter i next to the last position &k for which 
1 j4(L)=1,,+:(L,) is changed into i+1. Otherwise, the letter i in the lead 
position is changed into i + 1. 


Proposition 1. Robinson's recomposition rules (1) and (2) preserve the property of 
nonintersection. 


Let the set of all nonintersecting paths which are lattice permutations in 
(Al [ti]... [Hn] be denoted by (Al [H1,..., Hm]. We give a reduction formula 
which transforms nonlattice permutation paths into lattice permutation paths. 
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Proposition 2. Let L be a nonintersecting path in (A\[U,,..., Mm~il[Um] with 
negative index I,,_, m(L) = —k <0. 


Then, Robinson's recomposition rule (1) gives a unique path in (A\[u,+ 
1)... 5 m1 + Um—1) lm — k] for some a,€ Z, i=1,...,m—1and |a| =k. 
Conversely, for any path L in (A\[ui+@1,.--5 matt m1 lm — k] with 
a,€Z,,i=1,...,m—1 and |a|=k, Robinson’s recomposition rule (2) gives a 
unique path L in (A|[ty, ~~. Um—il[Mm] with negative index I,_\ (L)= —k <0. 
As a result, we have the following operator identity: 


(itp ces SIDS Uy Hees eet Ee RS 


k=0 k=|a| 


4. Main result 
Using the identity (1) repeatedly, we obtain the following result. 


Theorem. In the path correspondence of Proposition 2, we have the following 
operator identities: 


m-1 
[Hi Mm = 2 (1 > [H1,---+ Hi, +1,..., 


iy<s<i<m-1 


Hi, + Ly yd Lm—1][Hm —k], (2) 
[H4, see Lm] = > e(o)[H, + o(1) ) 1] se [Hn + o(m) ~ m]. (3) 


Remark. Since there is a 1-1 correspondence between the Schur functions and 
the arrangement of stones, by applying the identity (3) to the state (0| of the 
empty Young tableau, we have the well known Jacobi—Trudy identity. 
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This paper is a survey of results on chromatic polynomials of graphs which are generaliza- 
tions of trees. In particular, chromatic polynomials of q-trees will be discussed. The smallest 
q-tree (q > 1) is the complete graph K, on q vertices. A q-tree on n + 1 vertices where n = q, is 
obtained by adding a new vertex adjacent to each of q arbitrarily selected, mutually adjacent 
vertices in a q-tree on n vertices. Another generalization of trees is the n-gon-trees. The 
smallest n-gon-tree (n = 3) is the n-gon which is a cycle on n vertices. A n-gon-tree with k + 1 
n-gons is obtained from a n-gon-tree with k n-gons by adding a new n-gon which has exactly 
one edge in common with any n-gon of a n-gon-tree with k n-gons. 


1. Introduction 


The graphs which we consider here are finite, undirected, simple and loopless. 
Let P(G, A) be the chromatic polynomial of the graph G. A graph G is uniquely 
m-colorable if and only if p(G, m) = m!. Two graphs are said to be chromatically 
equivalent if their chromatic polynomials are equal. 

In [3], Chao and Whitehead proved that a large class of nonisomorphic 
connected graphs are chromatically equivalent. This class includes the cactus 
graphs; a cactus graph is a connected graph where each pair of cycles have no 
edge in common. In a planar graph, a cycle is a mini-cycle if and only if it is one 
of the two smaller cycles in every @-subgraph. A @-subgraph is a subgraph which 
consists of two cycles with exactly one common edge. A connected planar graph F 
is forest-like if each pair of cycles have at most one edge in common, and if the 
dual graph F¢ of F is a forest where F4 is obtained from F by replacing each 
mini-cycle by a vertex, and by joining two vertices in F* by an edge if and only if 
the corresponding mini-cycles in F have one edge in common. Chao and 
Whitehead proved that all forest-like connected planar graphs with n vertices, e 
edges and the same number of mini-cycles of each length are chromatically 
equivalent. 


2. q-trees 


The graphs called 2-trees are defined by recursion. The smallest 2-tree is the 
complete graph K, on 2 vertices. A 2-tree on n+1 vertices (where n= 2) is 
obtained by adding a new vertex adjacent to each of 2 arbitrarily selected 
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adjacent vertices in a 2-tree on 7” vertices. (It should be noted that 2-trees are 
examples of forest-like connected planar graphs where the mini-cycles are 
triangles. A triangle is a cycle of length 3.) In [5S], Skupieri stated a problem to 
find a characterization of 2-trees. In [6], Whitehead proved that a graph G is a 
2-tree on n vertices if and only if P(G,A)=A(A-1)(A-2)""?. The proof 
technique is based on the fact that any n-vertex graph G having chromatic 
polynomial A(A — 1)(A —2)"~? is a uniquely 3-colorable graph. The method of 
proof is similar to the method used by Xu and Li, in [7], to prove that a wheel on 
an odd number of vertices is chromatically unique. 

The graphs called q-trees are defined by recursion. The smallest q-tree is the 
complete graph K, on q vertices. A q-tree on n+1 vertices (where n >q) is 
obtained by adding a new vertex adjacent to each of q arbitrarily selected, 
mutually adjacent vertices of a q-tree on 7 vertices. In [2], Chao et al. proved 
that a n-vertex graph G is a q-tree if and only if P(G, A)=A(A-1)---(A-—q+t 
1)(A — q)”? where n = q. In their proof, they use the fact that any graph which is 
chromatically equivalent to a q-tree is a uniquely (q + 1)-colorable graph. They 
obtain a sharp upper bound estimate on the number of triangles in such a graph. 
They show that q-trees are the only graphs which obtain this upperbound. Every 
(q + 1)-coloring of a uniquely (q + 1)-colorable graph results in the same partition 
of its vertices into color classes. Chao et al. study the subgraphs induced by the 
union of any & of these color classes where 2<k <q +1. 


3. Trees of polygons 


In [1], Chao and Li studied chromatic polynomials of “trees of polygons’’. Let 
n be a positive integer where n >3. The graphs called n-gon-trees are defined by 
recursion. The smallest n-gon-tree is the n-cycle, denoted C,,, which is the only 
two-connected graph containing n vertices and n edges. A n-gon-tree with k + 1 
n-gons is obtained from a n-gon-tree with k n-gons by adding a new n-gon which 
has one edge in common with any n-gon of a n-gon-tree with k n-gons. Fig. 1 
gives examples of a 3-gon-tree with six 3-gons, and a 4-gon-tree with five 4-gons. 
(In [6], Whitehead studied 3-gon-trees calling them 2-trees.) Chao and Li proved 
the following theorem which characterizes n-gon-trees. 


Fig. 1. 
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Theorem. A graph G is a n-gon-tree with k n-gons (where integer n=3 and 
integer k2=1) if and only if P(G,A)=A(A-1)(Q(C,,, A))* where Q(C,, A) = 
P(C,, A) + (A(A- 1)). 


Their proof of this theorem is based on Kuratowski’s characterization of 
non-planar graphs in terms of K; and K33 homeomorphs, Chao and Zhao’s 
characterization (see [4]) of two-connected graphs containing no subgraphs 
homeomorphic to K,, and a careful analysis of the coefficients of the chromatic 
polynomial of a n-gon-tree. 


4. Unsolved problems 


Problem 1. For each positive integer m, classify the uniquely m-colorable graphs. 
These graphs should yield new families of chromatically unique graphs. 


Problem 2. Find a characterization of arbitrary trees of polygons similar to the 
characterization of n-gon-trees given by Chao and Li in [1]. By arbitrary trees of 
polygons, we mean trees of polygons where the length of the polygons are not 
required to be the same. 
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1. Introduction 


Suppose G,, G2,..., G, are graphs of order at most n. We say that there is a 
packing of G,, G),..., G, into the complete graph K,, if there exist injections 
a@;:V(G)—> V(K,), i=1,2,...,k such that a*(E(G))Nas(E(G,)) = for 
i#j, where the map a}: E(G;)— E(K,) is induced by a;. Similarly, suppose G is 
a graph of order m and H is a graph of order n > m and there exists an injection 
a:V(G)—V(H) such that a*(E(G))N E(H) = ¢, then we say that there is a 
packing of G into H, and in case n =m, we also say that there is a packing of G 
and H, or G and H are packable. Thus G can be packed into H if and only if G is 
embeddable in the complement H of H. However, there is a slight difference 
between embedding and packing. In the study of embedding of a graph into 
another graph, usually at least one of the two graphs is fixed whereas in the study 
of packing of two graphs very often both the graphs are arbitrarily chosen from 
certain classes of graphs. 

In this paper we shall concentrate mainly on some results on the following two 
packing problems. The first one is on dense packings of trees of different sizes 
into K,. The second one is on general packings of two graphs having small size. 
Some open problems in this area will also be mentioned. 

We shall use the following notation and definitions. The order, the size and the 
maximum valency of a graph G are denoted by |G|, e(G) and A(G) respectively. 
A tree, a star and a path of order i are denoted respectively by 7;, S; and P,. The 
tree obtained from S,_, (n =5) by inserting a new vertex on an edge is denoted 
by S,. The tree obtained from S,,_,(n = 6) by inserting a new vertex on the edge 
which is not incident with the vertex of maximum valency of S;,_, is denoted by 
S’,. We denote by $(6) the tree obtained from S, by adding two new vertices each 
of which is joined to one end-vertex of S,. The cycle of length m and the null 
graph of order r are denoted by C,, and 0, respectively. The complete bipartite 
graph having bipartition V, and V, such that |V,| =m and |V,|=7 is denoted by 
Kmn- The graph obtained from C, by adding an edge joining two opposite 
vertices is denoted by C;. The disjoint union of two graphs G and H is denoted 
by GUH, and mG stands for the disjoint union of m copies of G. A graph of 
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order n and size m is called an (n, m) graph. If a is a packing of G into H, then 
the set of edges of H incident with at least one vertex of a(x), x € V(G), is said 
to be covered by G. 

Before we give our survey, let us point out that it has been shown that to find 
an efficient algorithm to pack two general graphs G and H is an NP-hard problem 
(see Garey and Johnson [15; p. 64]). However, if one restricts G and H to some 
special kind of graphs, for instance, both G and H are trees, then there exist 
polynomial time algorithms for the packing of G and H (mentioned in 
Hedetniemi et al. [17]). 


2. Packing n—1 trees of different size into K, 


Erdos and Gallai [11] proved that every graph G having size e(G) > 4 |G| (k - 
1) contains a path of length k. In 1963, Erdés and Sdés made the following 
conjecture. 


Conjecture (Erdés and Sés). If G is a graph and e(G)>4|G|(k—1), then G 
contains every tree T of size k. 


Motivated by the above conjecture, Gydrfds raised the question whether any 
sequence of trees T,, 73,..., 7, can be packed into K,. Gyarfds’ question has 
now been quoted in many books and research papers as the Tree Packing 
Conjecture (TPC). 


Tree Packing Conjecture. Any sequence of trees T,, T3,...,T, can be packed 
into K,,. 


Till now, the TPC is still open. However, the TPC has been proved valid for 
guite a few cases. We list some of the results below. 


Theorem 2.1 (Gyarfas and Lehel [16]). Any sequence of trees T,, T;,..., T,, in 
which all but two are stars, can be packed into K,,. 


Theorem 2.2 (Gy4rfaés and Lehel [16]). Any sequence of trees T, Ts,..., Ty 
where T, € {S;, P,}, can be packed into K,,. 


Theorem 2.3 (Hobbs and Kariraj; see Hobbs [18]). Suppose each T,, 2<i <n, 
with at most one exception, has diameter at most three. Then T,, T;,..., T, can be 
packed into K,,. 


Theorem 2.4 (Straight [26]). Suppose for each T,, 2<i<n, with at most two 
exceptions, A(T;)2=i-2. Then T,, T3,..., T, can be packed into K,,. 
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Theorem 2.5 (Straight [26]). Suppose for each T,, 2<i<n, with at most one 
exception, A(T;)2i-—3. Then T,, Tj, ... , T,, can be packed into K,,. 


Theorem 2.6 (Straight [26]). Suppose L, is a tree of order i shown in Fig. 1. Then 
any sequence of trees T,,T;,...,T7,, where T,€ {S,, P, L;}, can be packed 


Beoeaneae 


e 
L., i even L,, i odd 
i i 


Fig. 1. 


Theorem 2.7 (Fishburn [14]). Jf each T; is a double star, unimodal triple star, 
interior-3 caterpillar or thin-body spider, then T,, T3,..., 1, can be packed 
into K,,. 


(For definitions of the terms used in this theorem, see Yap [33].) 

Straight [26] and Fishburn [14] verified that the TPC is true for all n <9. The 
proof of Theorem 2.2 originally due to Gyarfas and Lehel was very complicated. 
A very nice, short proof of Theorem 2.2 has been found by Zaks and Liu [34]. 

Since a resolution to the TPC is hard, we may look for some other aspects of a 
similar nature. The following is a result of such an attempt. 


Theorem 2.8 (Bollobds (3]). If 3<s<4V2n, then any sequence of trees 
T,, Ts,..., T, can be packed into K,,. 


The following are some results concerning dense packings of paths into 
complete bipartite graphs. 


Theorem 2.9 (Zaks and Liu [34]; Fink and Straight [13]). Any sequence of paths 
P,, Py, ..., Pa, can be packed into K,,,. 


Theorem 2.10 (Zaks and Liu [34]; Fink and Straight [13]). For odd n, the 
sequence of paths P3, Ps,..., Po»+, can be packed into K,, n+ - 


We note that the above results on dense packings of trees having different sizes 
into K, have been applied in the study of packing sequences of integers into a 
matrix. For details, see Hobbs [19]. 

Finally, we mention here another open conjecture on dense packing of trees. 


Ringel’s conjecture (Ringel [22]). For any tree T,,, there is a packing of 2n +1 
copies of T,4, into Kry41. 
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Some progress on certain problems closely related to the Ringel’s conjecture 
has been made. For details, we refer the readers to Huang and Rosa [20] and the 
references given therein. 


3. Packing two graphs of small size 


We next give a survey of the general packing of two graphs having small size. 


Theorem 3.1 (Sauer and Spencer [23]). Let G and H be graphs of order n. If 
e(G)e(H) <(5), then G and H are packable. 


The example that G=S,,, and H=mK, shows that Theorem 3.1 is best 
possible. However, if neither G nor H is a star, then perhaps, this theorem can be 
improved considerably. 

The following theorem was first announced by Catlin [9]. A proof of this 
theorem was subsequently given by Sauer and Spencer [23] (see also Bollobds 
(2; P.425]). 


Theorem 3.2. Suppose G and H are graphs of order n. If 2A(G)A(H) <n, then 
there is a packing of G and H. 


Bollobas and Eldridge [5S] pointed out that the result of Theorem 3.2 is almost 
best possible. For example, suppose d, and d, are two integers such that 
d,<d,<n and suppose n <(d, + 1)(d) + 1) —2. For i=1, 2, letn =p(d, + 1)+ 
rn, lsr<d,+1. Let G=p,Kg,,,UK,, and let H=p.Kg,,,UK,,. Then 
A(G)=d,, A(H)=4d, and there is no packing of G and H. Thus if n= 
(A(G) + 1)(A(H) + 1) — 2, then G and H may not be packable. Motivated by this 
example, they made the following conjecture. 


Conjecture (Bollobaés and Eldridge). If G and H are graphs of order n and 
(A(G) + 1)(A(H) + 1) <n 4+ 1, then there is a packing of G and H. 


We remark that by Theorem 3.2, this conjecture is true when A(G) = 1. It is 
mentioned in Bollobds [2; P. 426] that if this conjecture is true for A(G) = 2 then 
it would imply a theorem of Corrddi and Hajnal (‘On the maximal number of 
independent circuits in a graph,’ Acta Math. Acad. Sci. Hungar. 14 (1963) 
423-439) and that if this conjecture is true in general, then it would extend a 
theorem of Hajnal and Szemerédi (Proof of a conjecture of Erdés, in ‘“‘Com- 
binatorial Theory and its Applications”, Vol. II (Eds., P. Erdés, A. Renyi and 
V. T. Sés), Colloq. Math. Soc. Bolyai 4, North-Holland (1970), 601-623). For 
details, see Catlin [10]. 

We next consider the problem of packing two graphs G and H in which 
e(G) + e(H) is small. Milner and Welsh [21] noticed that if any two graphs G and 
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H of order n such that e(G) + e(H) < [3(n — 1)] are packable, then one can prove 
that the computational complexity of any graph property F has lower bound 
(3(n —1)]. They conjectured that e(G) + e(H) <[3(n — 1)] is sufficient for the 
packing of two graphs G and H. This conjecture was proved by Sauer and 
Spencer [23]. A generalization of this result is given below. 


Theorem 3.3 (The main packing theorem of Bollobds and Eldridge [5]). Suppose 
H and G are graphs of order n, A(H), A(G)<n-1, e(H)+e(G) <2n -3 and 
{H, G} is not one of the following pairs: {2K,,0,UK3}, {0,UK3, K,UK3}, 
{3Kz,0,UK,}, {0;UK3,2K3}, {2K,UK3,0;UK,}, {0,UK,4, K2U2K3} and 
{0;U K,, 3K3}. Then there is a packing of G and H. 


The proof of the main packing theorem of Bollobas and Eldridge depends 
heavily on Lemma 3.4 given below. An alternative proof of Lemma 3.4 and a 
slight simplification of the original proof of Theorem 3.3 can be found in Teo [28] 
(see also Yap [33]). 


Lemma 3.4 (Bollobas and Eldridge [5]). Let T be a tree of order p and let G bea 
graph of order n. Suppose 2<2p <n (n=5), A(G)<n-1 andn-1<e(G)s 
n+ (n/p) —3. Then there is a packing of T and G such that T covers at least p + 1 
edges of Gand A(G -T)<n-p-1. 


The main packing theorem of Bollobas and Eldridge extends an earlier result 
of Burns and Schuster [7] (who proved that every (n, n—2) graph can be 
embedded in itself). This theorem, in turn, has been extended to the case that 
e(H) + e(G) <2n — 2 where n = |H| =|G|. The extension is carried out in several 
steps. A preliminary step is to show that any tree T of order n =5 can be packed 
into an (n, n — 1) graph G. This result (Theorem 3.5) is due to Slater et al. [25]. 
It generalizes an earlier result of Hedetniemi et al. [17]. The original proof of 
Theorem 3.5 uses induction on n by deleting two vertices from both T and G, and 
as a result many cases (when n = 5, 6) have to be verified. An alternative proof of 
Theorem 3.5, using induction on n by deleting only one vertex from both T and 
G, was later given by Teo [28] (see also Yap [33]). 


Theorem 3.5 (Slater, Teo and Yap [25]). Let T be a tree of ordern=5 and let G 
be an (n, n—1) graph. If neither T nor G is a star, then there is a packing of T 
and G. 


An extension of Theorem 3.5 were independently obtained by Schuster [24] as 
well as Teo and Yap [29]. The generalization is as follows. 


Theorem 3.6 (Schuster [24]; Teo and Yap [29]). Suppose T is a tree of ordern>5 
and G is an (n,n) graph such that A(T), A(G)<n—1. If {T, G} #{P;,0,U 
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Ci}, (Ps, 0. U Ky}, {S(6), 0,U K,}, or if G = UC; where i> 4 for at least one i 
and T #S; or if G=kC; and T #S,, or Sj, then there is a packing of T and G. 


Our next step to extend the main packing theorem of Bollobds and Eldridge is 
to find the forbidden pairs for the packing of two (n,n —1) graphs G and H 
(Theorem 3.7). For n =5, there are thirteen forbidden pairs. We shall adopt the 
notation shown in Fig. 2. 


re) ° re) .e) 
G(5): H5: if G(8): O i, 
re) 
Fig. 2. 


Theorem 3.7 (Teo and Yap [29]). Suppose G and H are two (n,n —1) graphs, 
n=5, which are not stars. If {G, H} is not one of the following thirteen pairs: 

(1) {RUK3, P,UK3}, (2) (0,UCs, 0,UC.}, 3) {G(5), RUC}, (4) 
{0,U Ky, P,;UCs}, (S) {03 U Ky, PxUK3}, (6) {03U Kg, Hp}, (7) {0,U2Ks, 
S,U K3}, (8) {0,U2K3, 0;3UK,4}, (9) {0,U2K3, 0,U2K3}, (10) {G(8), BU 
2K3}, (11) {0Q.UP,UK,, P,U2K3}, (12) {OQ,UKs, P,U3K3}, and (13) {K3U 
S,-3, KzUS,-3}, n =8, then there is a packing of G and H. 


We note that Theorem 3.7 generalizes an earlier result of Burns and Schuster 
(8] (who gave a complete characterization of packing an (n, n — 1) graph, n>4, 
with itself) and Theorem 3.5. 

The forbidden pairs {G, H} for packing two graphs G and H of order n for 
which e(G)+e(H)<2n-—2 have been completely determined. Besides the 
forbidden pairs given in Theorems 3.3 and 3.7, there are thirty more such pairs. 
Because there are so many forbidden pairs, for simplicity, we shall state the result 
only for n =9. We require the notation shown in Fig. 3. 


G(9): 
oO o fp 


Theorem 3.8 (Teo and Yap [30]). Suppose G and H are graphs of order n=9 
such that A(G), A(H)<n—1 and e(G) + e(H) <2n — 2. If {G, H} is not one of 
the following thirteen pairs: 

{3K3,05U K4}, {3K,U K3,0,U Ks}, {0s UK,, K,UK,;UK,}, {05U Ky, Ho}, 
{G(9), 3K3}, {0;U K,UK,, 3K3}, {0,UK4,2K;UK,}, {2K,U2K3, 0;U Ks}, 


Fig. 3. 
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{0.U Ks, K,U3K3}, {0,U Ks, 4K3}, {0,US,-1, UC}, {K,US,_2, kC3} and 
{K;US,_3, K3US,-3}, then G and H are packable. 


Finally, we mention two results on packing two graphs G and H of sufficiently 
large order. Let t,_,(n) be the size of the Turan graph 7,(n). Then ¢,_,(n) is 
approximately equal to 3[{1 — 1/(s — 1)]n?. 


Theorem 3.9 (Bollobds and Eldridge [5]; see also Bollobas [2; P.426]). Let m and 
s be positive integers such that 


()=m<("5") 

2) - ae 

Let G and H be graphs of order n, e(G) =m and 
e(H)<(5)-4-1(n)-1 

If nis sufficiently large, then there is a packing of G and H. 


Note that the example G = K, UO,_, and H = T,_,(n) shows that Theorem 3.9 
is best possible. 


Theorem 3.10 (Bollobds and Eldridge [5]; see also Bollobas [2; P.427]). Suppose 
0<a<}. If G and H are graphs of order n, e(G) <an, e(H) <3(1-2a)n™, 
and n is sufficiently large, then there is a packing of G and H. 


Note that the example G = K,,,U0,_,-, and H =(¢K,,, shows that Theorem 
3.10 is near to being best possible. 

Further results, conjectures and open problems on the packing of two graphs 
can be found in Bollobas (2; 436-437], and Bollobds and Eldridge [4, 5, 6]. 


4, Miscellanea 


In this section we mention a few results on some other packing problems. 


4.1 Packing a graph with itself 


Burns and Schuster [7] proved that every (n, n — 2) graph can be packed with 
itself. They also found all the forbidden (n,n —- 1) graphs G which cannot be 
packed with itself [8]. We have noted earlier that these results follow immediately 
from Theorems 3.3 and 3.7. In [12], Faudree et al. gave a complete characteriza- 
tion of packing (n, n) graphs (n >= 5) with itself. They also proved that if G is a 
graph of order n such that e(G) < $n —2, G#S, and G contains no cycles of 
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length 3 or 4, then G can be packed with itself. Motivated by this result, they 
made the following conjecture. 


Conjecture (Faudree, Rousseau, Schelp and Schuster [12]). Each graph which is 
not a star and contains no cycles of length 3 or 4 as subgraphs is embeddable in its 
complement. 


4.2 Packing digraphs 


Benhocine et al. [1] as well as Wojda and Wozniak [32] studied the packings of 
digraphs (directed graphs). Two digraphs D, =(V,, A,) and D,= (V2, Az) of the 
same order are packable if there exists a bijection a:V,—V, such that if 
(x, y)¢A,, then (a(x), a(y))€A2. In [1], it has been proved that if D,= 
(Vi, A,) and D, =(V,, Az) are digraphs of order n such that |A,| |A2|<n(n — 1), 
then D, and D, are packable. From this, it follows that if |A,|+|A2|<2n— 2, 
then D, and D, are packable. A complete characterization for packing two 
digraphs D, = (V,, A,) and D, = (V5, A.) of order n for which |A,| + |A2| <2n — 1 
is also obtained in [1]. In [31], Wojda made the following conjecture. 


Conjecture (Wojda [31]). Let k and n be integers satisfying 1<k<n(n—-1). 
Denote by f(n, k) the minimal number such that there exist digraphs D, = (V,, A;) 
and D,=(V2, A2) of order n, with |A,| =k and |A2|=f(n, k), for which there is 
no packing of D, and D,. Then for every m satisfying 2<m <n/2, f(n,n—m)= 
2n —[n/m]. In particular, f(n, n — 2) =2n — [n/2]. 


5. Some remarks on the Erdés-Sdés conjecture 


In Section 2 we mentioned that a result of Erdés and Gallai [11] confirms that 
the Erdés-Sés conjecture is true for T=P,,,. The assumption that e(G) > 
5|G|(k—1) implies that G has at least one vertex of valency k. Hence the 
Erd6és-S6s conjecture is also true for T = S,,,. It is not difficult to prove that the 
Erdés-S6s conjecture is also true for T = S,,,. It would be interesting to prove 
that the Erdés-Sos conjecture is true for some other types of trees. 

In a recent paper, Zhou [35] proved that the Erdés-Sés conjecture is true for 
k =|G|-—1. In fact, for the cases k=|G|—1 and k=|G|-—2, the Erdés-Sés 
conjecture follows immediately from Theorem 3.5. To show this, we first note 
that if e(g) >3u(k — 1) where v =|G|, then e(G) <4u(u —k). Thus Erdés-Sés 
conjecture can be restated as follows. 


Erdés-Sés conjecture. Suppose G is a graph of order v and T is any tree of size k. 
If e(G) <4u(u — k), then T and G are packable. 


Now if k=v—1 or k=v—2, then e(G) <v — 1. Hence, by Theorem 3.5 (we 
assume that T is not a star), if y=5, then T and G are packable. Thus Erdés-Sés 
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conjecture is true for k=v—1 and k=vu-—2. (The case that v<4 can be 
verified easily.) 

The above proof technique for the case k =v — 2 fails for other values of k. It 
would be interesting to develop a new proof technique to settle the case 
k=v-3., 

In his lecture at this conference, Szemerédi [27] mentioned that M. Ajtai, J. 
Komlés and himself have used the Uniformity Lemma to prove that if G is a 
graph and e(G) >4|G|(k—1), then G contains any tree of order (1—)k for 
small € and large k. 
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Let H be a hexagonal system. We define the Z-transformation graph Z(H) to be the graph 
where the vertices are the perfect matchings of H and where two perfect matchings are joined 
by an edge provided their symmetric difference is a hexagon of H. We prove that Z(H) is a 
connected bipartite graph if H has at least one perfect matching. Furthermore, Z(H) is either 
an elementary chain or graph with girth 4; and Z(H) — V,, is 2-connected, where V,, is the set 
of monovalency vertices in Z(H). Finally, we give those hexagonal systems whose Z- 
transformation graphs are not 2-connected. 


In the past few years several kinds of transformation graphs were introduced 
such as tree graph [1], minimum tree graph [2], perfect matching polyhedra [3], 
matroid basis graph [4], Euler tour graph [5], and so on. In such a graph a vertex 
is a special sort of subgraph of a specified graph, and two vertices are joined by 
an edge provided that they can transform to each other by some specified 
transformation. Almost all of the above mentioned transformation graphs can be 
shown as polyhedra of (0, 1). Their properties have been extensively studied [6]. 

Recently some transformation graphs were studied and proved not to be 
polyhedra of (0, 1), such as generalized directed tree graphs [7] and hierarchical 
tree of Kekulé patterns of hexagonal systems [8]. 

In this paper we introduce a new kind of transformation graph, namely 
Z-transformation graph of perfect matchings of hexagonal systems and observe 
some of its properties. Note that this kind of transformation is not polyhedra of 
(0, 1). 

A hexagonal system, also called ‘honeycomb system” or “hexanimal”’ (see e.g. 
[9]) is a finite connected plane graph with no cut-vertices in which every interior 
region is surrounded by a regular hexagon of side length 1 [10]. A perfect 
matching of a graph G is a set of disjoint edges of G covering all vertices of G. 

In the present paper we confine oursleves to those hexagonal systems which 
have at least one perfect matching. 


Definition 1. Let H be a hexagonal system with perfect matchings. The 
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Z-transformation graph Z(H) is the graph where the vertices are the perfect 
matchings of H and where two perfect matchings M, and M, are joined by an 
edge provided that their symmetric difference M, A M, is a hexagon of H. Let 
s be a hexagon of H and M, AM, =s. Then M, As =M, and M, As=M,. We 
call that M, and M, can be obtained from each other by a Z-transformation 
(see Fig. 1). 


Let 5,, 52,..., 5, be distinct hexagons of H. We use H[s,, 52, ... , 5,] to denote 
the graph induced by the edges of s,, 52,..., 5. 


Theorem 2. Let H be a hexagonal system with perfect matchings. Then Z(H) is 
bipartite. 


Proof. If Z(H) does not contain a cycle, then Z(H) is bipartite. 

Now assume that Z(H) contains a cycle C=M)M,...M,, where My= M.. 
Thus there exists a series of hexagons s,,..., 5, such that M,=M,_, 45s, for 
p=1i,...,¢. Suppose that s'’,...,s’~' and s” are distinct hexagons among 
them, and s' (i=1,..., m) appears 5(s') times in {s,,...,5,}. We now show 
that 5(s') is an even number for i=1,..., m. Note that if s’ contains an edge 
lying on the perimeter of H, then 6(s') must be even since M,= 
(...((Mo 4 s,) A s2)...) As.) = Mo. Now suppose each edge of s‘ does not lie 
on the perimeter of H. Since C is a cycle, there is no loss in generality in 
assuming that s'=s,. Let edge e €s,, and the other hexagon containing the edge 
e be s*. Then we have 5(s') + 5(s*) is even. Assume that 5(s‘) is odd. Thereby 
d(s*) is odd too. Let e* #e, e* €s*, and e* is parallel to e. Denote the other 
hexagon containing e* by s**. Then 5(s**) is odd too. Repeat this discussion, 
since H is finite, we eventually reach a hexagon s'€ {s',..., 5} such that s’ 
contains an edge lying on the perimeter of H and 6(s’) is odd. This is contrary to 
the fact that for each hexagon s in {s',... , s} containing an edge lying on the 
perimeter of H, 5(s) must be even. This contradiction shows that 5(s‘) is even. 
Therefore t = Y7, 5(s‘) is even, i.e. C is an even cycle. 


Z-transformation graphs 407 
The proof is completed. O 


Theorem 3. Let H be a hexagonal system with perfect matchings. Then Z(H) is 
connected. 


Proof. From the theorem in [8], we can deduce that any perfect matching of H is 
connected with the special perfect matching of H (called root Kekulé pattern), 
since every simultaneous rotation can be considered as a series of rotations of a 
single hexagon. So Z(H) is connected. 

In order to simplify the discussion, a hexagonal system is to be placed on a 
plane so that a pair of edges of each hexagon lie in parallel with the vertical line. 
Before continuing we review briefly some results about hexagonal systems. Let M 
be a perfect matching of a hexagonal system H. If six vertices of a hexagon s of H 
are covered by three edges of M, i.e. s is an M-alternating cycle, then s is called 
respectively, proper and improper sextet [12] (see Fig. 2). For each hexagonal 
system with perfect matchings, there exist exactly two perfect matchings, one of 
which has only proper sextets, the other of which has only improper sextets [12]. 
From this we can deduce that for each hexagonal system H, the Z-transformation 
graph Z(H) has at most two vertices of monovalency. 0 


O O 


proper sextet improper sextet 


Fig. 2. 


Let V,, be the set of monovalency vertices of Z(H). Then we have the 
following. 


Theorem 4. Let H be a hexagonal system with perfect matchings. Then Z(H) is 
either a path or a graph of girth 4 and Z(H) — V,, is 2-connected. 


Proof. Suppose Z(H) is not a path. Let Me V(Z(H)) — V,,. We first show that 
H has at least two disjoint hexagons which are M-alternating cycles. If M has 
valency greater than two in Z(H), then the above conclusion is evident. Now 
suppose that M has valency two in Z(H), i.e. there are only two M-alternating 
cycles which are hexagons of H, say s, and s2. If s, and sz are edge disjoint, then 
there is nothing to prove. The remainder case is that s, and s, have an edge in 
common (see Fig. 3). Let M,=M A 52. It is not difficult to verify that for M,, 


BIO 


Fig. 3. 
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besides s,, there is at most one hexagon, say s3, which is an M,-alternating cycle. 
In other words, M, has valency at most two in Z(H). The same is true of 
M,=M As,, i.e. there is at most one hexagon besides s,, say s3, which is an 
M,-alternating cycle. Note that 5,52, 5:53 and s,s3 are in parallel with one 
another (see Fig. 3). Repeat this discussion, since H is finite, we can find a path 
M}M._,...M,MM,...M,_,M, (r 21, t2=1) which is a component of Z(H). By 
the connectivity of Z(H), Z(H) is a path itself and this contradicts the 
hypothesis. Therefore, M has at least two disjoint hexagons which are M- 
alternating cycles. This implies that if Z(H) is not a path, for any M € Z(H) -— V,, 
M is contained in a 4-cycle of Z(H), namely MM,M,.M;3, where m,=M As,, 
M,=M, As, and M;=M, A5,, and s, and sz are edge disjoint hexagons of H 
which are M-alternating cycles. Since Z(H) is bipartite, the girth of Z(H) is 4. 

In the following we shall prove that Z(H) — V,, is 2-connected. Obviously, 
Z(H) — V,, is connected. It suffices to prove the conclusion: for any 2-path of 
Z(H), say M,M,Ms3, there is another path M,M;}... M3 joining M, and M; which 
is internally vertex disjoint with M,M.M;. In fact, if the above conclusion holds 
but Z(H) — V,, is not 2-connected, then there is a cut vertex M of Z(H) - V,,. 
Take two vertices M’' and M” from different components of Z(H)-V,,—-M 
which are adjacent to M in Z(H). Then there is no other path connecting M’ and 
M", a contradiction. 

Now let M,M.M; be a 2-path in Z(H), and M,=M,As,, M3=M,4A 5p, 
where s, is a hexagon of H which is an M,-alternating cycle, and s2 is a hexagon 
of H which is an M,-alternating cycle. If s, and s2 are edge disjoint, then M,M)M; 
is another path joining M, and M;, where M,= M, As, and Mj;= M3 As5,. If s, 
and s2 are edge joint, then by previous discussion, there is another hexagon s, 
which is M,-alternating cycle and is edge disjoint with s,. If s,; and s, are also edge 
disjoint, then there is another path P’=M,M5M3M,Msz joining M, and M;, 
where M5 =M, As3, Mj=M,4s,, Mz=M; As. and Ms=M, 453. If s3 and 
s2 have an edge in common, since the degree of M; in Z(H) is greater than 1, 
there is another hexagon s, which is an M,-alternating cycle. It is not difficult to 
check that s, and s, are edge disjoint. If s, and s, are also edge disjoint, then we 
obtain a path as before joining M, and M3. If s, and s, have an edge n in common 
(see Fig. 4), we can get another path P”=MM,M3M3M,M<M3, where M; = 
M As3, Ms=M;As,, M3=M2A54, Mg=M3A53, M5=M,A 52 and Mjy= 
Ms Qsq. The proof is complete. 0 
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A, Ce 


Fig. 5. 


By the above theorem, a Z-transformation graph with V,, =§ is 2-connected. 
We now turn our attention to investigate when V,, is non-empty. 

Let H be a hexagonal system, sp be a hexagon of H, O be the center of so. 
Draw three straight lines through O such that every line perpendicularly 
intersects a group of parallel edges of H. In fact, we obtain six half lines denoted 
as OA,, OA2, OB,, OB,, OC, and OC, (see Fig. 5). We call OA;-OB,-OC, 
(i=1, 2) as coordinate system with respect to (briefly, w.r.t) so. Evidently, a 
coordinate system OA;-OB,-OC; w.r.t. sy divides the plane into three areas 
A;OB,, B,OC;, C;OA;. 

Let OA-OB-OC be a coordinate system w.r.t. 59. For a point w lying in some 
area, say AOB, we define the coordinates of w to be the lengths of OW, and 
OWs (see Fig. 6), and denote W(OA) and W(OB), respectively. 

A characteristic graph T(H) of a hexagonal system H is defined to be the graph 
where vertex set is the set of the hexagons of H, and where two hexagons are 
joined by an edge provided they have a common edge. In fact, there is a natural 
way to draw the graph T(H) of H. We can always let the vertices of T(H) to be 
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The boundary of T(H) is 


X4 X5 Kg Xp Xo XX gXEX7XEX 3 Xq XQXqXy ‘i 
Fig. 7. 


the centers of hexagons of H, two centers O; and O; are joined by edge provided 
the corresponding hexagons have a common edge (see Fig. 7). 

Evidently, T(H) is a planar graph. We define the perimeter of T(H) to be the 
boundary of the exterior face, i.e. a close walk in which each cut edge of T(H) is 
traversed twice (see Fig. 7). 

Let OA-OB-OC be a coordinate system of H. If the boundary of T(H) lying 
in some area, say AOB, is a path W,W,... W, after deleting the edges lying in 
OA and OB and the path satisfies W,(OA) = W,(OA)=---2=W,(OA) and 
W,(OB) < W,(OB) <---<W(OB); or W,(OA) < W,(OA) <---<W,(OA) and 
W,(OB) = W,(OB) => ---=W,(OB), then we call the perimeter of T(H) to be 
monotone in area AOB. If the perimeter of T(H) is monotone in all three areas, 
then T(H) is called to be monotone w.r.t. the coordinate system OA-OB-OC 
(see Fig. 8). 

Let M be a perfect matching of H. M is called to be 3-dividable w.r.t. the 
coordinate system OA-OB-OC provided any edge of M does not intersect the 
lines OA, OB and OC, and two edges of M lie in the same area iff they are 
parallel (see Fig. 9). 


Fig. 8. Fig. 9. 
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Lemma 5. Let H be a hexagonal system, v be a vertex lying on the perimeter of H. 
If H or H-—v has a perfect matching M, then there is a hexagon which is an 
M.-alternating cycle. 


Proof. by Lemma 2.1 in [11] or Lemma 2 in [13], the lemm is immediate. 0 


The following theorem will describe those hexagonal system whose Z- 
transformation graphs have one monovalecy vertex. 


Theorem 6. Let H be a hexagonal system. The following three statements are 
equivalent. 

(i) Z(H) has one monovalency vertex. 

(ii) There exist a hexagon so and a coordinate system OA-OB-OC w.r.t. So, 
and a perfect matching M of H such that M is 3-dividable w.r.t. the 
coordinate system OA-OB-OC. 

(iii) There exist a hexagon so and a coordinate system OA-OB-OC w.r.t. so 
such that the perimeter of T(H) is monotone w.r.t. the coordinate system 
OA-OB-OC. 


Proof. (ii) > (i) is evident. 

()>(H). 

Let M be a monovalency vertex of Z(H), so be the only hexagon which is a 
M-alternating cycle. Let OA-OB-OC be the coordinate system w.r.t. sy such 
that each of the three edges e,, e, and e; of MM 5p does not intersect the lines 
OA, OB and OC. 

Suppose that M is not 3-dividable w.r.t. OA-OB-OC. Then in at least one 
area, say AOB, there is an edge e € M which is not parallel to e, (see Fig. 10). 
Since M is a monovalence vertex of Z(H), it is not difficult to see that H has a 
series of hexagons s,,..., 5, such that e* es, lies on the perimeter of H, and the 


Fig. 10. 
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Fig. 11. 


vertices of s; (1 <i <t) are matched by the edges of M as depicted in Fig. 10. Let 
X be the set of all the hexagons of H. If the hexagon s§ ¢ H, then the component 
H’ of H[X —{s,,...,5,}] which contains the vertex u is a hexagonal system 
satisfying the conditions in Lemma 5. Hence there is another hexagon which is 
M-alternating cycle. This contradicts that M is a monovalency vertex in Z(H). 

If s5 € H, according to e' ¢ M or e’e€ M (see Fig. 11), there are two possible 
cases, as shown in Fig. 11, where e” lies on the perimeter of H. Let H’ be the 
component of H[X — {s,,..., 5,59, 51,.--,5,}] which contains the vetex u. By 
analogous arguments as above, both cases are impossible. Thus M is 3-dividable 
w.r.t. OA~OB-OC. 

(ii) > Wii) 

By contradiction. Suppose that sy), M and OA—OB-OC mean just the same as 
in the statement (ii). But the perimeter of T(H) is not monotone w.r.t. 
OA-OB-OC. Without loss of generality, we may assume in area AOB there are 
two vertices O, and O, of T(H) lying on the perimeter of T(H) with 
O;(OA) < O,(OA) and O,(OB)<O,(OB); moreover, edges e and e’ are on the 
perimeter of H (see Fig. 12). 

We can find a series of hexagons of H s,, 5,,...,5,, (¢ 20), such that the edge 
e” is on the perimeter of H. Let the component of H[X — {s;,5,,...,5,}] 
containing the hexagon s; be H,. Evidently, H, satisfies the conditions in Lemma 5 
hence contains a hexagon which is a M-alternating cycle. This contradicts the 
statement (ii). 

(iii) > (i) 

By induction on the number of hexagons of H. 

Suppose that there is a coordinate OA-OB-OC w.r.t. a hexagon sp of H such 
that the perimeter of T(H) is monotone w.r.t. the coordinate system OA-OB- 
OC. If H contains only one hexagon, there is nothing to prove. We now suppose 
that H contains more than one hexagon. If the vertices of T(H) all lie on the axes 
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Fig. 12. 


OA, OB and OC, then it is easy to check that the statement (ii) hold. We now 
assume that there is at least one vertex v on the perimeter of T(H) not lying on 
the axes. Without loss of generality, we may assume v is in the area AOB (see 
Fig. 13), and v’(OA) = v(OA), v'(OB) <u(OB), v"(OA) <u(OA), v"(OB)= 
v(OB), where v’ and vu” lie on the perimeter of T(H) and are adjacent to v. 

Let s; be the hexagon of H whose center is v. Evidently the perimeter of 
H[X — {s;}] is still monotone w.r.t. the coordinate system OA—OB-—OC. By the 
inductive hypothesis there exists a perfect matching M’ which is 3-dividable w.r.t. 
OA-OB-OC. Therefore the perfect matcing M' Ue’ of H is 3-dividable w.r.t. 
the coordinate system OA-OB-OC. This completes our proof. 0 


In the following we shall give a more intuitive description for a hexagonal 
system H whose Z-transformation graph has a monovalency vertex. Note that for 
any hexagonal system H there exists a smallest hexagon with its edges parallel to 
the edge of H and containing the interior of H in its interior. We denote it by 
S(H). 


Fig. 13. 
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Fig. 14. 


Theorem 7. Let H be a hexagonal system. Then Z(H) contains a monovalency 
vertex iff the following three conditions are satisfied. 
(1) There are three pairwise disjoint edges of S(H) each of which contains 
exactly an edge e; of H (i =1, 2, 3). 
(2) The perpendicular bisectors of e,, ¢2 and e, intersect at the center 0 of some 
hexagon So of H. 
(3) The perimeter of T(H) is monotone w.r.t. the coordinate system OA-OB- 
OC shown in Fig. 14. 


Proof. By Theorem 6, it is easy to verify that if the conditions (1)-(3) are 
satisfied, then Z(H) has a monovalency vertex. 

Conversely, assume that Z(H) has a monovalency vertex M. By Theorem 6, 
there exist a hexagon sy of H and a coordinate system OA—OB-OC w.r.t. so such 
that the perimeter of T(H) is monotone w.r.t. OA—OB-OC. Moreover M is 
3-dividable w.r.t. OA-OB-OC. It is not difficult to see that the edge e; which is 
intersected by one of the coordinate axes OA, OB and OC and is the farthest 
from the center of sy) must be contained in an edge L; of S(H) and L; contains 
only one such edge of M. Evidently, the axis intersecting e, is the perpendicular 
and bisector of e;. 

Now the proof is completed. O 
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